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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 

THEREOF 
FIELD OF INVENTION 
The present invention relates to Chlamydia antigens and corresponding DNA 
molecules, which can be used in methods to prevent and treat Chlamydia infection in 
mammals, such as humans. 

BACKGROUND OF THE INVENTION 
Chlamydiae are prokaryotes. They exhibit morphologic and structural similarities to 
gram-negative bacteria including a trilaminar outer membrane, which contains 
lipopolysaccharide and several membrane proteins that are structurally and functionally 
analogous to proteins found in E coli. They are obligate intra-cellular parasites with a unique 
biphasic life cycle consisting of a metabolically inactive but infectious extracellular stage and 
a replicating but non-infectious intracellular stage. The replicative stage of the life-cycle 
takes place within a membrane-bound inclusion which sequesters the bacteria away from the 

_cytoplasm.ofthe_infected.host cell _. 

C. pneumoniae is a common human pathogen, originally described as the TWAR 
strain of Chlamydia psittaci but subsequently recognised to be a new species. C. pneumoniae 
is antigenieally, genetically and morphologically^-distinct from other chlamydia species (G^— 



m 20 



25 



30 



trachomatis, C. pecorum and C. psittaci). It shows 10% or less DNA sequence homology 
with either of ^trachomatis or C. psittaci and so far appears to consist of only a single strain, 
TWAR. 

C. pneumoniae is a common cause of community acquired pneumonia, only less 
frequent than Streptococcus pneumoniae and Mycoplasma pneumoniae (Ref 1,2). It can also 
cause upper respiratory tract symptoms and disease, including bronchitis and sinusitis (Ref 
1,3,4,5). The great majority of the adult population (over 60%) has antibodies to C. 
pneumoniae (Ref 5), indicating past infection which was unrecognized or asymptomatic. 

Of considerable importance is the association of atherosclerosis and C pneumoniae 
infection. There are several epidemiological studies showing a correlation of previous 
infections with C. pneumoniae and heart attacks, coronary artery and carotid artery disease 
(Ref 6-10). Moreover, the organisms has been detected in atheromas and fatty streaks of the 
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coronary, carotid, peripheral arteries and aorta (Ref 11-15). Viable C. pneumoniae has been 
recovered from the coronary and carotid artery (Ref 16,17). Furthermore, it has been shown 
that C, pneumoniae can induce changes of atherosclerosis in a rabbit model (Ref 18). Taken 
together, these results indicate that it is highly probable that C. pneumoniae can cause 
atherosclerosis in humans,** though, , the epidemiological .» impqrtance^ of chlamydial 
atherosclerosis remains*to be^demonstrated; 

A number* of recent studies have also indicated -anvassociation between«C. pneumoniae 
infection and*asthma. Infection-has4>een M adult-onset 
asthma and acute exacerbations of asthma in adults, and small-scale studies have shown that 
prolonged antibiotic treatment was effective at greatly reducing the severity of the disease in 
some individuals (Ref 19-24). 

In light of these results a protective vaccine against C. pneumoniae infection would be 
of considerable importance. There is not yet an effective vaccine for any human chlamydial 
infection. Nevertheless, studies with C trachomatis and C psittaci indicate that this is an 
attainable goal***, For*exainple|ilmieejr^ lung#infection with C. 

rracAomari^are^protected^from-infe^ 

Similarly, sheep, immunized with^inactivated C. psittaci were protected- from subsequent 
cMamydial-mduc^wabortioiK^ infections 
has been-associated ^with ETlil hiimunerTesponsesT-rparticularly, the^ induc t ion of v INFg 



120 producing CD4 ^T-cells*(Refr€7) . The*adopti^e^transfei^ clones to 

nude or SCIDymi<^**eonfei^ disease (Ref 

28,29), and in vivo depletion of CD4+ T cells exacerbated disease post-challenge (Ref 
30,31). However, the presence of "sufficiently high titres of neutralising antibody at mucosal 
surfaces can also exert a protective effect (Ref 32). 

25 The extent of antigenic variation within the species C. pneumoniae is not well 

characterised. Serovars of C. trachomatis are defined on the basis of antigenic variation in 
MOMP, but published C. pneumoniae MOMP gene sequences show no variation between 
several diverse isolates^ of the orgamsm*,(Rek*33-35).- RegionsMofc*the^protein known to be 
conserved in\othet*chlamydiakM^ pneumoniaen^R.eff&3,3A). One 

30 study has describM#a*sti^*of«C^^ 

weight, but ttoeT^ene#fof*this^^ of outer 
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membrane protein 2 from nine diverse isolates were also found to be invariant (Ref 16). The 
genes for HSP60 and HSP70 show little variation from other chlamydial species, as would be 
expected. The gene encoding a 76kDa antigen has been cloned from a single strain of C. 
pneumoniae. It has no significant similarity with other known chlamydial genes (Ref 4). 
5 Many antigens recognised by immune sera to C. pneumoniae are conserved across all 

chlamydiae, but 98kDa, 76 kDa and 54 kDa proteins may be C pneumoniae-sptcific (Ref 2, 
4, 36). Immunoblotting of isolates with sera from patients does show variation of blotting 
patterns between isolates, indicating that serotypes C. pneumoniae may exist (Ref 1,16). 
However, the results are potentially confounded by the infection status of the patients, since 
10 immunoblot profiles of a patient's sera change with time post-infection. An assessment of the 
number and relative frequency of any serotypes, and the defining antigens, is not yet possible. 

C. pneumoniae infection usually presents as an acute respiratory disease (i.e., cough, 
sore throat, hoarseness, and fever; abnormal chest sounds on auscultation). For most patients, 
the cough persists for 2 to 6 weeks, and recovery is slow. In approximately 10% of these 
15 cases, upper respiratory tract infection is followed by bronchitis or pneumonia. Furthermore, 

dwing-a. C.-pneumoniae-tpideim^ 

in about half of these pneumonia patients, particularly in the infirm and the elderly. As noted 
above, there is more and more evidence that C. pneumoniae infection is also linked to diseases 
— other than respiratory infections. 



Ty 20 The reservoir for the organism is presumably people. In contrast to C. psittaci 

% infections, there is no known bird or animal reservoir. Transmission has not been clearly 
defined. It may result from direct contact with secretions, from formites, or from airborne 
spread. There is a long incubation period, which may last for many months. Based on 
analysis of epidemics, C. pneumoniae appears to spread slowly through a population (case-to- 
25 case interval averaging 30 days) because infected persons are inefficient transmitters of the 
organism. Susceptibility to C. pneumoniae is universal. Reinfections occur during adulthood, 
following the primary infection as a child. C. pneumoniae appears to be an endemic disease 
throughout the world, noteworthy for superimposed intervals of increased incidence 
(epidemics) that persist for 2 to 3 years. C.trachomatis infection does not confer cross- 
30 immunity to C. pneumoniae. Infections are easily treated with oral antibiotics, tetracycline or 
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erythromycin (2 g/d, for at least 10 to 14 d). A recently developed drug, azithromycin, is 
highly effective as a single-dose therapy against chlamydial infections. 

In most instances, C pneumoniae infection is often mild and without complications, 
and up to 90% of infections are subacute or unrecognized. Among children in industrialized 
5 countries, infections Jiave been*thought to be rare iip4o the age,pf 5 y, although#a recent study 
(E Normann et al, Chlamydia pneumoniae in children with acutewrespirato 
Acta Paediatrica, 1998y Vol 87, Iss 1, pp 23-27) has reported that-many; children in this^age 
group show PGR evidenee*of infection despite beingvseronegativef»and«estimates-a prevalence 
of 17-19% in 2-4 y olds. In developing countries, the seroprevalence of C. pneumoniae 
10 antibodies among young children is elevated, and there are suspicions that C. pneumoniae may 
be an important cause of acute lower respiratory tract disease and mortality for infants and 
children in tropical regions of the world, 
ijj From seroprevalence studies and studies of local epidemics, the initial C. pneumoniae 

l~! infection usually happens between the ages of 5 and 20 y. In the USA, for example, there are 
Q 15 estimated to be 3O ? 0O0*easfcs*©f-cMdhPfo^^ pneumoniae. 

O Infections may clustei^among^gren^ 

Iff military conscripts). 

;\ C. pneumpMiaememsQsmlO ^ tract 

iy20 pneumonia infection^mayiraceount^for^SO to%60^%>ipf^ 

IS periods, also, more^episodes^qf ^mixed«infections^withr5c- pneummiQ&hW£&ttn*reported. 

Reinfection during adulthood is common; the clinical presentation tends to be milder. Based on 
population seroprevalence studies, there tends to be increased exposure with age; which is 
particularly evident among men. Some investigators have speculated that a persistent, 

25 asymptomatic C. pneumoniae infection state is common. 

In adults of middle age or older, C. pneumoniae infection may progress to chronic 
bronchitis and sinusitis. A study in the USA revealed that the incidence of pneumonia caused 
by C. pneumoniae in persons, youngej than^60* years Js. 1 case<«perJL ,000-pepspns per year; but 
in the elderly, thelkdisease^inddenee^rose^three^fold^C pneumoniaeMMe^m&Arety leads 40 

30 hospitalization, exeept#in«patdente«witta 
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SUMMARY OF THE INVENTION 

The present invention provides purified and isolated DNA molecules that encode 
Chlamydia polypeptides designated CPN100626 (SEQ ID No: 1,2), which can be used in 
methods to prevent, treat, and diagnose Chlamydia infection. The encoded polypeptides 
include polypeptides having the amino acid sequence shown in SEQ ID No: 3 . Those skilled in 
the art will appreciate that the invention also includes DNA molecules that encode mutants and 
derivatives of such polypeptides, which result from the addition, deletion, or substitution of 
non-essential amino acids as described herein. The invention also includes RNA molecules 
corresponding to the DNA molecules of the invention. 

In addition to the DNA and RNA molecules, the invention includes the 
corresponding polypeptides and monospecific antibodies that specifically bind to such 
polypeptides. 

The present invention has wide application and includes expression cassettes, 
vectors, and cells transformed or transfected with the polynucleotides of the invention. 
Accordingly, the present invention provides (i) a method for producing a polypeptide of the 
invention in a recombinant -host-system and -related expression _ cassettes, vectors, and 
transformed or transfected cells; (ii) a live vaccine vector, such as a pox virus, Salmonella 
typhimurium, or Vibrio cholerae vector, containing a polynucleotide of the invention, such 
^ vaccine vectors b e ing usefel for, e.g., prev en ting a nd—tre ati ng Chlainydi<^ -mfcc^^=m= 



!y 20 combination with a diluent or carrier, and related pharmaceutical compositions and associated 
*2 therapeutic and/or prophylactic methods; (iii) a therapeutic and/or prophylactic method 
involving administration of an RNA or DNA molecule of the invention, either in a naked form 
or formulated with a delivery vehicle, a polypeptide or combination of polypeptides, or a 
monospecific antibody of the invention, and related pharmaceutical compositions; (iv) a 
25 method for diagnosing the presence of Chlamydia in a biological sample, which can involve 
the use of a DNA or RNA molecule, a monospecific antibody, or a polypeptide of the 
invention; and (v) a method for purifying a polypeptide of the invention by antibody-based 
affinity chromatography. 

BRIEF DESCRIPTION OF THE DRAWINGS 
30 The present invention will be further understood from the following description 

with reference to the drawings, in which: 
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Figure 1 shows the nucleotide sequence of the CPN 100626 (SEQ ID No: 1 - entire 
sequence and SEQ ID No: 2 - coding sequence). The protein is encoded on the negative 
strand. 

Figure 2 shows the deduced amino acid sequence of the CPN 100626 protein from 
5 Chlamydia pneumonia^E^lBMo^ ). The protein4s*e^^ 

Figure 3 shows«*the*xestoiction enzyme**analysisrof the^gene^eneoding the *;C. 
pneumoniae*GPN 100626 gene* 

DETAILED DESgRlPTfrON^^ 
In the C pneumoniae genome, open reading frames (ORFs) encoding chlamydial 
10 polypeptides have been identified. These polypeptides include polypeptides permanently found 
in the bacterial membrane structure, polypeptides that are present in the external vicinity of the 
bacterial membrane, include polypeptides permanently found in the inclusion membrane 
m structure, polypeptides that are present in the external vicinity of the inclusion membrane, and 
2 polypeptides that are released into the cytoplasm of the infected cell. These polypeptides can 
H 1 5 be used in vaccination^methodsrfbr pre ventingc^d*treating4(^ 

O -According* to -^fo^ isolated 

m polynucleotides encoding the precursor and mature fonnsaof ^/amytf ^polypeptides. 

An isolated ^polynucleotide of the*invention*eneodes (i) a polypeptide having an 
llf amino acid^sequcncc lhaUis homologous to a Chlamydia ammo - acid, th e . Chlamydia amino acid - 
IU 20 sequence being selected^from*the* group eonsisting^f:* 
iJ (a) the amino acid sequences as show5n^_(SEQJD No: 3) 

The term "isolated polynucleotide" is defined as a polynucleotide removed from the 
environment in which it naturally occurs. For example, a naturally-occurring DNA molecule 
present in the genome of a living bacteria or as part of a gene bank is not isolated, but the 
25 same molecule separated from the remaining part of the bacterial genome, as a result of, e.g., 
a cloning event (amplification), is isolated. Typically, an isolated DNA molecule is free from 
DNA regions (e.g., coding regions) with which it is immediately contiguous at the 5 1 or 3' 
end, in the naturally oceun?ing^genome.vt Sueh^isolatedwpolj^eleQtides^could be part of a 
vector or a composition* and^stilMSe isolated*in that*such a veetor^or*eompQsitio%is*not part of 
30 its natural environmelntr - 
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A polynucleotide of the invention can be in the form of RNA or DNA (e.g., 
cDNA, genomic DNA, or synthetic DNA), or modifications or combinations thereof. The 
DNA can be double-stranded or single-stranded, and, if single-stranded, can be the coding 
strand or the non-coding (anti-sense) strand. The sequence that encodes a polypeptide of the 
5 invention as shown in SEQ ID NOs: 1 and 2, can be (a) the coding sequence as shown in SEQ 
ID NOs: 2 (b) a ribonucleotide sequence derived by transcription of (a) ; or (c) a different 
coding sequence; this latter, as a result of the redundancy or degeneracy of the genetic code, 
encodes the same polypeptides as the DNA molecules of which the nucleotide sequences are 
illustrated in SEQ ID NOs: 1 to 2. 
10 By "polypeptide" or "protein" is meant any chain of amino acids, regardless of 

length or post-translational modification (e.g., glycosylation or phosphorylation). Both terms 
are used interchangeably in the present application, 
i By "homologous amino acid sequence" is meant an amino acid sequence that differs 

! from an amino acid sequence shown in SEQ ID No: 3, only by one or more conservative 
lis amino acid substitutions, or by one or more non-conservative amino acid substitutions, 

\ - deletions, or- additions - located - at- positions at which __they__do. not__ destroy__ the specific 

i 

antigenicity of the polypeptide. 

Preferably, such a sequence is at least 75%, more preferably 80%, and most 
; preferably 90% identical to mi amino acid sequence shown in SEQ TD No: 3. = 
Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in SEQ ID No:3. By "amino acid 
sequence substantially identical" is meant a sequence that is at least 90%, preferably 95%, 
more preferably 97%, and most preferably 99% identical to an amino acid sequence of 
reference and that preferably differs from the sequence of reference, if at all, by a majority of 
25 conservative amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among amino 
acids of the same class. These classes include, for example, amino acids having uncharged 
polar side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids 
having basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side 
30 chains, such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, 
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such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan, and cysteine. 

Homology is typically measured using sequence analysis software (e.g., Sequence 
Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
5 Biotechnology^Center, 1710 University Avenue^ Madison, WI 53705}. Similar amino acid 
sequences are aligned to obtain the maximum degree of homology (i.e., identity). To this end, 
it may be necessary to artificially introduce gaps into the sequence. Once; the optimal 
alignment has been set up, the degree of homology {i.e., identity) is established by recording 
all of the positions in which the amino acids of both sequences are identical, relative to the 
10 total number of positions. 

Homologous polynucleotide sequences are defined in a similar way. Preferably, a 
homologous sequence is one that is at least 45%, more preferably 60%, and most preferably 
W 85% identical to (i) a coding sequence of SEQ ID NOs:l and 2. 

ill Polypeptides having a sequence homologous to one of the sequences shown in SEQ 

Hl5 ID NO: 3, include- natarally^ccurrm^ any other non- 

S naturally occurring variants that are aiialogous in tenns of antigenicity, to ^.a polypeptide having 

In a sequence as shown in SEQ ID NO: 3. 

As is knownan the art, an allelic? variant is an:altemate form of a polypeptide that is 
III characterized as having a s ubs titut io n, dele t io n, or ad dttion^o f on e or mor e am ino-aeida^-ter 
■|y 20 does not alter the bioldgigal^fiinetion ofrthfepolypepti^^ is meant the 

35 function of the polypeptide in the cells in which it naturally occurs ,^even if the function is not 
necessary for the growth or survival of the cells. For example, the biological function of a 
porin is to allow the entry into cells of compounds present in the extracellular medium. The 
biological function is distinct from the antigenic function. A polypeptide can have more than 
25 one biological function. 

Allelic variants are very common in nature. For example, a bacterial species, 
£ g fC. pneumoniae, is usually represented by a variety of strains that differ from each other 
by minor allelie^yariationsa<Indeedf a pol>^eptidevthat*fulfills ,.the^ saiHetebiological function in 
different strains*ean*have an amino racid sequence that&is not#identicalfinueaGh%of the" strains. 
3 o Such an allelic variation>^y*sbe* 
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Support for the use of allelic variants of polypeptide antigens comes from, e.g., 
studies of the Chlamydial MOMP antigen. The amino acid sequence of the MOMP varies 
from strain to strain, yet cross-strain antibody binding plus neutralization of infectivity occurs, 
indicating that the MOMP, when used as an immunogen, is tolerant of amino acid variations. 
5 Polynucleotides, e.g., DNA molecules, encoding allelic variants can easily be 

retrieved by polymerase chain reaction (PCR) amplification of genomic bacterial DNA 
extracted by conventional methods. This involves the use of synthetic oligonucleotide primers 
matching upstream and downstream of the 5' and 3' ends of the encoding domain. Suitable 
primers can be designed according to the nucleotide sequence information provided in SEQ ID 
10 NOs:l and 2. Typically, a primer can consist of 10 to 40, preferably 15 to 25 nucleotides. It 
may be also advantageous to select primers containing C and G nucleotides in a proportion 
sufficient to ensure efficient hybridization; e.g., an amount of C and G nucleotides of at least 
Ti 40% , preferably 50% of the total nucleotide amount. 

=jj Useful homologs that do not naturally occur can be designed using known methods 

^15 for identifying regions of an antigen that are likely to be tolerant of amino acid sequence 
□ ^hMgeTand/or-deletions7 -For example r sequences of-the antigen from different-species can be 
fi compared to identify conserved sequences. 

Polypeptide derivatives that are encoded by polynucleotides of the invention 
Z include, e.g., fragments, polypeptid es-faa ving larg e inte rnal deletio n s d eri v e d from full - kH gth= 
polypeptides, and fusion proteins. 

Polypeptide fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences shown in SEQ ID NO: 3, to the extent that the 
fragments retain the substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large internal deletions that remove a 
25 substantial part of the parent polypeptide, while retaining specific antigenicity. Generally, 
polypeptide derivatives should be about at least 12 amino acids in length to maintain 
antigenicity. Advantageously, they can be at least 20 amino acids, preferably at least 50 
amino acids, more preferably at least 75 airrino acids, and most preferably at least 100 amino 
acids in length. 
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Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using 
computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens 
having potential as surface-exposed, antigenic regions (Ref 37). 

Polypeptide fragments and polypeptides having large internal deletions can be used 
5 for revealing epitopes^that^are^otheFwise^masked in the^parenUpoly peptide -andrfthat may be of 
importance for inducing a protective T cell-dependent immune .responses Ddletions can also 
remove immunodominant regions of high variability among*strains. 

It is an accepted practice kin the^field of immunolQgy to use«fcagm§nts and variants 
of protein immunogens as vaccines, as all that is required to induce an immune response to a 
10 protein is a small (e.g., 8 to 10 amino acid) immunogenic region of the protein. This has been 
done for a number of vaccines against pathogens other than Chlamydia. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine 
m mammary tumor virus, peptide containing 11 amino acids; (Ref 38), Semliki Forest virus, 
S peptide containing 16 amino acids (Ref 39), and canine parvovirus, 2 overlapping peptides, 
R 15 each containing 15 ammoj^ids.^ be effectd^e>vaceine antigens 

O against their respective pathogens 

i J; Polynucleotides encoding ^polypeptide fragments and- polypeptides having large 

internal deletions can be constructed using^standard methods*(Refe41^), for example, by PCR, 
b including invers^CR;»4>y^ the-cloned DN A mol e cule s, or J b y_ 

m 20 the method of Kunkel et al. >(Ref«42) biologiqaUmateriaUavailable^at Stratagene- 
2 A polypeptide^deri^ative can also be produced as a fusion polypeptide that contains 

a polypeptide or a polypeptide derivative of the invention fused, e.g., at the N- or C-terrmnal 
end, to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be 
easily obtained by translation of a genetic fusion, i.e., a hybrid gene. Vectors for expressing 
25 fusion polypeptides are commercially available, such as the pMal-c2 or pMal-p2 systems of 
New England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S- 
transferase system of Pharmacia, or the His-Tag system available from Novagen. These and 
other expression systems provide convenient means for further purification of polypeptides and 

derivatives of the invention. 
30 Another particular- example of msionfcpolypeptides includedrin indention includes a 

polypeptide or polypeptide' derivative*of the3inventiQn»fused«to a poly^eptidejihaving-'adjuvant 
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activity, such as, e.g., subunit B of either cholera toxin or E. coli heat-labile toxin. Several 
possibilities are can be used for achieving fusion. First, the polypeptide of the invention can 
be fused to the N-, or preferably, to the C-terminal end of the polypeptide having adjuvant 
activity. Second, a polypeptide fragment of the invention can be fused within the amino acid 
5 sequence of the polypeptide having adjuvant activity. 

As stated above, the polynucleotides of the invention encode Chlamydia 
polypeptides in precursor or mature form. They can also encode hybrid precursors containing 
heterologous signal peptides, which can mature into polypeptides of the invention. By 
"heterologous signal peptide" is meant a signal peptide that is not found in the naturally- 
10 occurring precursor of a polypeptide of the invention. 

A polynucleotide of the invention, having a homologous coding sequence, 
hybridizes, preferably under stringent conditions, to a polynucleotide having a sequence as 
m shown in SEQ ID NOs:l to 2. Hybridization procedures are, e.g., described in Ausubel et 
5 al. , (Ref 41), Silhavy et al. (Ref 43); Davis et al. (ref 44). Important parameters that can be 
B 15 considered for optimizing hybridization conditions are reflected in a formula that allows 
O calculation of a critical value, the melting temperature above which two complementary DNA 
w strands separate from each other Ref 45). This formula is as follows: Tm = 81 .5 + 0.5 x (% 
L G+C) + 1.6 log (positive ion concentration) - 0.6 x (% formamide). Under appropriate 
ill stringency conditions, hybridization temperature (Th) is approximately 20 to 40^C-20 
K20 25 °C, or, preferably 30 to 40°C below the calculated Tm. Those skilled in the art will 
i| understand that optimal temperature and salt conditions can be readily determined empirically 
in preliminary experiments using conventional procedures. 

For example, stringent conditions can be achieved, both for pre-hybridizing and 
hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 50% formamide 
25 or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC solution (1 M NaCl, O.l M sodium 
citrate (pH 7.0)). 

For polynucleotides containing 30 to 600 nucleotides, the above formula is used and 
then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions 
are defined by a Th that is 5 to 10° C below Tm. 
30 Hybridization conditions with oligonucleotides shorter than 20-30 bases do not 

exactly follow the rules set forth above. In such cases, the formula for calculating the Tm is 
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as follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% 
G+C would have an approximate Tm of 54 °C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or 
modifications or combinations thereof, can have various applications. For example, a DNA 
5 molecule can*be nised*(A)^in a* proc^sssf or^produeingrthe^ 

host system,- (ii) in the construction of vaccine vectors such as poxviruses? which are further 
used in methods and compositions for preventing and/or treaiuxg<Chlamydia infection, (iii) as a 
vaccine agent (as well as an RNA molecule), in a naked form or formulated^with a delivery 
vehicle and, (iv) in the construction of attenuated Chlamydia strains that can over-express a 
10 polynucleotide of the invention or express it in a non-toxic, mutated form. 

According to a second aspect of the invention, there is therefore provided (i) an 
expression cassette containing a DNA molecule of the invention placed under the control of the 
m elements required for expression, in particular under the control of an appropriate promoter; 
S (ii) an expression vector containing an expression cassette of the invention; (iii) a procaryotic 

R 15 or eucaryotic*eelbtt^ of ^ 

□ invention, as well* as *(iv)*a proeess^for^produeing^a polypeptide or pol^fDgptide derivative 
m encoded by a poLynucleotide^of the^invention^ which, involves^cultuping^a procaryotic or 
eucaryotic cen^txansfoimediior^tramfected^with«an exppsston*easse^^ of the 

if; invention? imder * condi t ioiis^ molecule^of 4he; invention and^= 
]U 20 recovering theienGoded-polypeptideioi^ 

£ A reeonibinant-expression^sy stem can JDte^^ eucaryotic 

hosts. Eucaryotic hosts include yeast cells (e.g.* Saccharomyces cerevisiae or Pichia 
pastoris), mammalian cells (e.g., COS1, NIH3T3, or JEG3 cells), arthropods cells (e.g., 
Spodoptera frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as 
25 E. coli is used. Bacterial and eucaryotic cells are available from a number of different sources 
to those skilled in the art, e.g., the American Type Culture Collection (ATCC; Rockville, 
Maryland). 

The^ehoi<^*of^the^^ the 
expressed polypeptide** For^example^it may«|be*useful»itp *produee*a polypeptide of the 

30 invention: in*aip^i^ 
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The choice of the expression cassette will depend on the host system selected as 
well as the features desired for the expressed polypeptide. Typically, an expression cassette 
includes a promoter that is functional in the selected host system and can be constitutive or 
inducible; a ribosome binding site; a start codon (ATG) if necessary, a region encoding a 
signal peptide, e.g., a lipidation signal peptide; a DNA molecule of the invention; a stop 
codon; and optionally a 3' terminal region (translation and/or transcription terminator). The 
signal peptide encoding region is adjacent to the polynucleotide of the invention and placed in 
proper reading frame. The signal peptide-encoding region can be homologous or 
heterologous to the DNA molecule encoding the mature polypeptide and can be specific to the 
secretion apparatus of the host used for expression. The open reading frame constituted by the 
DNA molecule of the invention, solely or together with the signal peptide, is placed under the 
control of the promoter so that transcription and translation occur in the host system. 
Promoters, signal peptide encoding regions are widely known and available to those skilled in 
the art and includes, for example, the promoter of Salmonella typhimurium (and derivatives) 
that is inducible by arabinose (promoter araB) and is functional in Gram-negative bacteria such 
as E. coli (as described in U.S. Patent No,- 5,028,530 and. in Cagnon et al., (Ref 46); the 
promoter of the gene of bacteriophage T7 encoding RNA polymerase, that is functional in a 
L number of E. coli strains expressing T7 polymerase (described in U.S. Patent No. 4,952,496); 
?^ OspA lipidaliuu signal peptide ; and RlpB lip i rifrtitm-Mgual peptide (Ref 47). 
jlj 20 The expression cassette is typically part of an expression vector, which is selected 

i| for its ability to replicate in the chosen expression system. Expression vectors (e.g., plasmids 
or viral vectors) can be chosen from those described in Pouwels et al. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). They can be purchased from various commercial 
sources. 

25 Methods for ti^fonning/transfecting host cells with expression vectors will 

depend on the host system selected as described in Ausubel et al., (Ref 41). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted in 
the extracellular medium or in the periplasmic space, or is embedded in the cellular 

30 membrane. The polypeptide can then be recovered in a substantially purified form from the 
cell extract or from the supernatant after centrifugation of the recombinant cell culture. 
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Typically, the recombinant polypeptide can be purified by antibody-based affinity purification 
or by any other method that can be readily adapted by a person skilled in the art, such as by 
genetic fusion to a small affinity binding domain. Antibody-based affinity purification 
methods are also available for purifying a polypeptide of the invention extracted from a 
5 Chlamydia strain* Antib6dies«useful for purifying-by -inmunoaffinity *the*poly!p5ptides .of the 
invention can be obtained as described below. 

A polynucleotide of the invention can also be useful in the vaccine field, e.g., for 
achieving DNA vaccination. There are two major possibilities, either-using a viral or bacterial 
host as gene delivery vehicle Give vaccine vector) or administering the gene in a free form, 
10 e.g., inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the 
invention can be evaluated as described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine 
n vector such as a poxvirus, containing a DNA molecule of the invention, placed under the 
J control of elements required for expression; (ii) a composition of matter containing a vaccine 
=! 15 vector of the invention^together. widwa- dUu^*©F*eanaer,^ 

composition containing a therapeutically or prophylactically, effeBtive^amount of a. vaccine 
vector of the invention; (iv) a memod for mducing*^ m a 

mammal (e.g., a minjant^teniata*e^ for 
iipgft/a^ditg^nfectioT^f-aniuials, . e.g., cats or birds), which involves- 

fy 20 administering to me *niaininaltan inimunogem^ vaceine*vector of the 

!| invention to elicit an immune-response, e.g., a proteetiye. or mecapejitie^immune response to 
Chlamydia ; and particularly, (v) a method for preventing and/or treating a Chlamydia (e.g., 
C. trachomatis, C. psittaci, C. pneumonia, C. pecorum) infection, which involves 
administering a prophylactic or therapeutic amount of a vaccine vector of the invention to an 
25 individual in need. Additionally, the third aspect of the invention encompasses the use of a 
vaccine vector of the invention in the preparation of a medicament for preventing and/or 
treating Chlamydia infection. 

A vaccme *veetor^of -4he*mve^ or 
derivatives of the mv.ention^as*well^^ 
30 homolog, mutantr or deri!»ative*there©fe*. limaO^imm0t -^m^^^J^1^-*-s^. *s 
interleukin-2 (IL-2) or m ter4eukin*l»2/(II>12)^^ 
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effect). Thus, a vaccine vector can include an additional DNA sequence encoding, e.g., a 
chlamydial antigen , or a cytokine, placed under the control of elements required for 

expression in a mammalian cell. 

Alternatively, a composition of the invention can include several vaccine vectors, 
each of them being capable of expressing a polypeptide or derivative of the invention. A 
composition can also contain a vaccine vector capable of expressing an additional Chlamydia 
antigen, or a subunit, fragment, homolog, mutant, or derivative thereof; or a cytokine such as 
IL-2 or EL-12. 

In vaccination methods for treating or preventing infection in a mammal, a vaccine 
vector of the invention can be administered by any conventional route in use in the vaccine 
field, particularly, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, 
rectal, vaginal, or urinary tract) surface or via the parenteral (e.g., subcutaneous, intradermal, 
intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice 
of the vaccine vector. The adininistration can be achieved in a single dose or repeated at 
intervals. The appropriate dosage depends on various parameters understood by skilled 
-artisans such as the vaccine veetor-itself,- the route of administration or the condition of the 
mammal to be vaccinated (weight, age and the like). 

Live vaccine vectors available in the art include viral vectors such as adenoviruses 
and poxviruses as well as bacterial vectors, e.g., Shigella, SalmoncUa^ibno choler a s 
Lactobacillus, Bacille bili6 de Calmette-Gu6rin (BCG), and Streptococcus. 

An example of an adenovirus vector, as well as a method for constructing an 
adenovirus vector capable of expressing a DNA molecule of the invention, are described in 
U.S. Patent No. 4,920,209. Poxvirus vectors that can be used include, e.g., vaccinia and 
canary pox virus, described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, 
respectively (also see, e.g., Tartaglia et al., Virology (1992) 188:217) for a description of a 
vaccinia virus vector; and Taylor et al, Vaccine (1995) 13:539 for a reference of a canary 
pox). Poxvirus vectors capable of expressing a polynucleotide of the invention can be 
obtained by homologous recombination as described in Kieny et al.. Nature (1984) 312:163 so 
that the polynucleotide of the invention is inserted in the viral genome under appropriate 
conditions for expression in mammalian cells. Generally, the dose of vaccine viral vector, for 
therapeutic or prophylactic use, can be of from about lxlO 4 to about 1x10", advantageously 
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from about IxlO 7 to about lxlO 10 , preferably of from about lxlO 7 to about lxlO 9 plaque- 
forming units per kilogram. Preferably, viral vectors are administered parenterally; for 
example, in 3 doses, 4 weeks apart. Those skilled in the art recognize that it is preferable to 
avoid adding a chemical adjuvant to a composition containing a viral vector of the invention 
and thereby minimizing the immune response^© the viral ^veetor itselfc 

Non-toxicogenic Vibrio cholerae mutant strains that are useful as a live oral vaccine 
are described in Mekalanos et ah, Nature (1983) 306:551 and U.S. Patent No, 4,882,278 
(strain in which a substantial amount of the coding sequence of each of the two ctxA alleles has 
been deleted so that no functional cholerae toxin is produced); WO 92/11354 (strain in which 
the irgA locus is inactivated by mutation; this mutation can be combined in a single strain with 
ctxA mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA 
sequences). These strains can be genetically engineered to express heterologous antigens, as 
described in WO 94/19482. An effective vaccine dose of a Vibrio cholerae strain capable of 
expressing a polypeptide or polypeptide derivative encoded by a DNA molecule of the 
invention can contain*^.*:, aboufclxlO 5 to about lxlO 9 , preferably about-lxlO 6 to about 1x10 s 
viable bacteria in an .appropriate volume for the selected- route of administration. Preferred 
routes of administration include ail mucosal ; routes; most preferably, these vectors are 
administered intranasaliy or orally . 
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expression of heterologous^ antigem^or^ m 
Nakayama et al. (Bio/Technology (1988) 6:693) and WQ 92/11361. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 

intranasaliy or orally. 

Others bacterial strains useful as vaccine vectors are described in High et a/., 
EMBO (1992) 11:1991 and Sizemore et al. y Science (1995) 270:299 (Shigella flexneri); 
Medaglini et aL, Proc. Natl. Acad. ScL USA (1995) 92:6868 (Streptococcus gordonif); and 
Flynn J.L., Cell. Mol. Biol. (1994) 40 (suppl. I):31, WO 88/6626, WO 90/0594, WO 
91/13157, WO 92/M96r and WQ 92/«21^ 

In bacterial vectors^ polynucleotide*^ *<the* inventions can^be ^inserted into the 
bacterial genome^or can*remahrin a free*state^cam^Qnva^plasmid* 
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An adjuvant can also be added to a composition containing a vaccine bacterial 
vector. A number of adjuvants are known to those skilled in the art. Preferred adjuvants can 
be selected from the list provided below. 

According to a fourth aspect of the invention, there is also provided (i) a 
5 composition of matter containing a polynucleotide of the invention, together with a diluent or 
carrier; (ii) a pharmaceutical composition containing a therapeutically or prophylactically 
effective amount of a polynucleotide of the invention; (iii) a method for inducing an immune 
response against Chlamydia, in a mammal, by atininistering to the mammal, an 
immunogenically effective amount of a polynucleotide of the invention to elicit an immune 
10 response, e. g. , a protective immune response to Chlamydia; and particularly, (iv) a method for 
preventing and/or treating a Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae, or 
C. pecorum) infection, by administering a prophylactic or therapeutic amount of a 
polynucleotide of the invention to an individual in need. Additionally, the fourth aspect of the 
invention encompasses the use of a polynucleotide of the invention in the preparation of a 
medicament for preventing and/or treating Chlamydia infection. The fourth aspect of the 
invention preferably includes the use of a DNA- molecule placed under conditions for 
expression in a mammalian cell, e.g., in a plasmid that is unable to replicate in mammalian 
cells and to substantially integrate in a mammalian genome. 

p a ymi deotid e s^BNA or RNA) of die invention can also b e adim nis tt^-as-s iK 
to a mammal for vaccine, e.g., therapeutic or prophylactic, purpose. When a DNA molecule 
of the invention is used, it can be in the form of a plasmid that is unable to replicate in a 
mammalian cell and unable to integrate in the mammalian genome. Typically, a DNA 
molecule is placed under the control of a promoter suitable for expression in a mammalian 
cell. The promoter can function ubiquitously or tissue-specifically. Examples of non-tissue 
25 specific promoters include the early Cytomegalovirus (CMV) promoter (described in U.S. 
Patent No. 4,168,062) and the Rous Sarcoma Virus promoter (described in Norton & Coffin, 
Molec. Cell Biol. (1985) 5:281). The desmin promoter (Li et al., Gene (1989) 78:243, Li & 
Paulin, J. Biol. Chem. (1991) 266:6562 and Li & Paulin, J. Biol. Chem. (1993) 268:10403) 
is tissue-specific and drives expression in muscle cells. More generally, useful vectors are 
30 described, La., WO 94/21797 and Hartikka et al., Human Gene Therapy (1996) 7:1205. 



ru 20 



17 



RY45 101998 



For DNA/RNA vaccination, the polynucleotide of the invention can encode a 
precursor or a mature form. When it encodes a precursor form, the precursor form can be 
homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, 
such as the leader sequence of the tissue-type plasminogen factor (tPA). 
5 A composition of the invention can contain one , or several polynucleotides of the 

invention. It can also contain at least one additional polynucleotide encoding another 
Chlamydia antigen such as urease subunit A, B, or both; or a fragment, derivative, mutant, or 
analog thereof. A polynucleotide encoding a cytokine, such as interleuJrin-2 (TL-2) or 
interleukin-12 (TL-12), can also be added to the composition so that the immune response is 
10 enhanced. These additional polynucleotides are placed under appropriate control for 
expression. Advantageously, DNA molecules of the invention and/or additional DNA 
molecules to be included in the same composition, can be carried in the same plasmid. 

Standard techniques of molecular biology for preparing and purifying 
polynucleotides can be used in the preparation of polynucleotide therapeutics of the invention, 
r, 15 For use as a vaeeine^a polynucleOtideV©f.me>mvent^ to various 

;3 methods. 

jjj First, a polynucleotide can be used in a naked form, free of any-delivery vehicles, 

such as anioniei. Jippsomes^ cataon.ic .lipids, mieEopartieles*- e.gvr gold microparticles, 
"p recipitating aagents,-g.g., calc i um phos p h atero r any oJi c r r Liaiisfo ct i o i ^ fa c ilitat i ng a gcnfc=fo= 
this case, the«fiotynueleotide-ean«be ,simply*jdUuted*ima physiologically, acceptable solution, 
such as sterttessalaKKff -sterate^ carriers When present, the 

carrier preferably is isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic 
strength, such as provided by a sucrose solution, e.g., a solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist in cellular 
25 uptake. It can be, i.e., (i) complemented with a chemical agent that modifies the cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (u) encapsulated into liposomes, 
or (iii) associated with cationic lipids or silica, gold, or tungsten microparticles. 

Anionicand^neutraUiposomes are well-known in the art (see, e.g., Liposomes: A 

Practical .Approaeh^ for 
30 making liposomes^and^^ including 

polynucleotides**. 
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Cationic lipids are also known in the art and are commonly used for gene delivery. 
Such lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3-dioleyloxy)propyl]- 
N^N-trimethylammomum chloride), DOTAP (l,2-bis(oleyloxy)-3- 

(trimethylammonio)propane), DDAB (dimethyldioctadecylammonium bromide), DOGS 
5 (dioctadecylamidologlycyl spermine) and cholesterol derivatives such as DC-Choi (3 beta-(N- 
(N\N' -dimethyl arninomethane)-carbamoyl) cholesterol). A description of these cationic 
lipids can be found in EP 187,702, WO 90/11092, U.S. Patent No. 5,283,185, WO 91/15501, 
WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids for gene delivery are 
preferably used in association with a neutral lipid such as DOPE (dioleyl 
10 phosphatidylethanolamine), as, for example, described in WO 90/11092. 

Other transfection-facilitating compounds can be added to a formulation containing 
cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, 
WO 94/25608, and WO 95/2397. They include, i.a. y spermine derivatives useful for 
facilitating the transport of DNA through the nuclear membrane (see, for example, 
1 15 WO 93/18759) and membrane-permeabilizing compounds such as GALA, Gramicidine S, and 
cationic bile salts (see, for example, WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as described in 
U WO 91/359, WO 93/17706, and Tang et aL (Nature (1992) 356:152). In this case, the 
?Z n rncropagt icle-coated polynucleotides can be injected via intradermal or intraepidermal rotites- 
W 20 using a needleless injection device ("gene gun"), such as those described in U.S. Patent 
W No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength 
of the promoter used in the DNA construct, the immunogenicity of the expressed gene 
product, the condition of the mammal intended for administration (e.g., the weight, age, and 
25 general health of the mammal), the mode of administration, and the type of formulation. In 
general, a therapeutically or prophylactically effective dose from about 1 fig to about 1 mg, 
preferably, from about 10 jig to about 800 /xg and, more preferably, from about 25 jig to about 
250 Mg> be administered to human adults. The administration can be achieved in a single 
dose or repeated at intervals. 
30 The route of administration can be any conventional route used in the vaccine field. 

As general guidance, a polynucleotide of the invention can be administered via a mucosal 
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surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, and urinary 
tract surface; or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, 
intradermal, intraepidermal, or intramuscular route. The choice of the administration route 
will depend on, e.g., the formulation that is selected. A polynucleotide formulated in 
5 association witfe^bupw/aeaine^is advantageously administered into, muscles. When a neutral or 
anionic liposome or a cationic lipid, such as DOTMA or DC-Choi, is used**the formulation 
can be advantageously injected via intravenous, intranasal (aerosolization), intramuscular-, 
intradermal, and- subcutaneous . routes.^ A polynucleotide^ a naked*form-can*advantageously 
be administered via the intramuscular, intradermal, or sub-cutaneous routes. 
10 Although not absolutely required, such a composition can also contain an adjuvant. 

If so, a systemic adjuvant that does not require concomitant administration in order to exhibit 
an adjuvant effect is preferable such as, e.g., QS21, which is described in U.S. Patent No. 
m 5,057,546. 

H The sequence information provided in the present application enables the design of 

H 15 specific nucleotide probes»ahd^primers«thaUean be usefuMn diagnosis. ^Accordingly, in a fifth 

Q aspect of the invention^there is provided a nucleotide probe *or primer-having -a sequence 

X found in or derived by degeneracy, of the genetic^code from a sequence shown in SEQ ID 
NO:l to 2, 

if I he^term^probe^as used^in d ie ^piescnt^appli^ tion r e fers -to D N A (preferably^ 

ft 20 single stranded) or RNA molecules (or modifications, or combinations hereof) that hybridize 
:§ under the stringent*QonditiQns?*as defined^abo.ve? to nucleic acid molecules 'having sequences 
homologous to those shown in SEQ ID NOs:l and 2, or to a complementary or anti-sense 
sequence. Generally, probes are significantly shorter than full-length sequences shown in SEQ 
ID NOs:l and 2; for example, they can contain from about 5 to about 100, preferably from 
25 about 10 to about 80 nucleotides. In particular, probes have sequences that are at least 75%, 
preferably at least 85%, more preferably 95% homologous to a portion of a sequence as 
shown in SEQ ID NOs:l and 2 or that are complementary to such sequences. Probes can 
contain modified bases such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino* 
5-deoxyuridine, or diamino-2, 6-purinev > Sugas or phosphate^residues ean-also*be modified or 
30 substituted. For example,** deoxyribose-riesidue»eanqbe replaced *y-a*polyanude (Nielsen et 
al, Science (1991) 254:1497)^d«pfc^^ 
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diphosphate, alkyl, arylphosphonate and phosphorothioate esters. In addition, the 2'-hydroxyl 
group on ribonucleotides can be modified by including, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, as capture or detection 
probes. Such capture probes can be conventionally immobilized on a solid support, directly or 
indirectly, by covalent means or by passive adsorption. A detection probe can be labelled by a 
detection marker selected from radioactive isotopes; enzymes such as peroxidase, alkaline 
phosphatase, and enzymes able to hydrolyze a chromogenic, fluorogenic, or luminescent 
substrate; compounds that are chromogenic, fluorogenic, or luminescent; nucleotide base 
analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization technique, 
such as dot blot (Maniatis et ah, Molecular Cloning: A Laboratory Manual (1982) Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York), Southern blot (Southern, J. 
Mol. Biol. (1975) 98:503), northern blot (identical to Southern blot to the exception that RNA 
is used as a target), or the sandwich technique (Dunn et al., Cell (1977) 12:23). The latter 
technique involves the use of a specific capture probe and/or a specific detection probe with 

© nucleotide-sequences-that-at-least-part^ - 

fi a primer is usually a probe of about 10 to about 40 nucleotides that is used to 

L initiate enzymatic polymerization of DNA in an amplification process (e.g., PCR), in an 
L; elongation process, or m a reverse transcription method. In a ^^ag^resto rethod involvin gs 
fU 20 PCR, primers can be labelled. 

;J Thus, the invention also encompasses (i) a reagent containing a probe of the 

invention for detecting and/or identifying the presence of Chlamydia in a biological material; 
(ii) a method for detecting and/or identifying the presence of Chlamydia in a biological 
material, in which (a) a sample is recovered or derived from the biological material, (b) DNA 
25 or RNA is extracted from the material and denatured, and (c) exposed to a probe of the 
invention, for example, a capture, detection probe or both, under stringent hybridization 
conditions, such that hybridization is detected; and (iii) a method for detecting and/or 
identifying the presence of Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA is extracted therefrom, (c) the 
30 extracted DNA is primed with at least one, and preferably two, primers of the invention and 
amplified by polymerase chain reaction, and (d) the amplified DNA fragment is produced. 
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As previously mentioned, polypeptides that can be produced upon expression of the 
newly identified open reading frames are useful vaccine agents. 

Therefore, a sixth aspect of the invention features a substantially purified 
polypeptide or polypeptide derivative having an amino acid sequence encoded by a 
5 polynucleotide»of,thesmvjenl>ion*s 

A ••substantiany^purjfied^polypeptide"' is defined-.as a polypeptide»tfaat. is separated 
from the environment in which it naturally occurs-and/or that-is free of-the-majority of the 
polypeptides that are present in the environment in which it was synthesized*: For example, a 
substantially purified polypeptide is free from cytoplasmic polypeptides. Those skilled in the 
10 art will understand that the polypeptides of the invention can be purified from a natural source, 
i.e. , a Chlamydia strain, or can be produced by recombinant means. 

Homologous polypeptides or polypeptide derivatives encoded by polynucleotides of 
01 the invention can be screened for specific antigenicity by testing cross-reactivity with an 
S antiserum raised against the polypeptide of reference having an amino acid sequence as shown 
H 15 in SEQ ID NOs:3-r^ Briefly^ monospeeifieihypja^^ against a 

□ purified reference polypep^dejas,sueh:'Or ,as.a fusipn**polyipeptide.>.for examplg*ian expression 
j}j product of MBP, GST, or ,His 5 tag*systems*or^i syntheti^peptide^prgdiGtedittp be antigenic. 
^ The homologous ; polyp^tidet©riderivatirce«ser^ be. produced 

b as such or as a msion»polypeptide ^ hi d ii s^ la Uei*c as eit a n d »if t h e y a n l is e r u u Kis also raised against 
B 20 a fusion polypeptide? tw6»different#fosfcrn«system be 
il determined according to a number of memods^meludmg^Westemyblot^Towbin et al., Proc. 
Natl. Acad. Sci. USA (1979) 76:4350), dot blot, and ELISA, as described below. 

In a Western blot assay, the product to be screened, either as a purified preparation 
or a total E. coli extract, is submitted to SDS-Page electrophoresis as described by Laemmli 
25 (Nature (1970) 227:680). After transfer to a nitroceUulose membrane, the material is further 
incubated with the monospecific hyperimmune antiserum diluted in the range of dilutions from 
about 1:5 to about 1:5000, preferably from about 1:100 to about 1:500. Specific antigenicity 
is shown once-a«bandi<&i*esp<pud^^^ ™ 
the above ranges 

30 in an»EBJS^assay^thei»produM«o*be*scr«ened».is iprefeFably*used*as the ooating 

antigen. A purified preparation 4s preferted^al^ used. 
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Briefly, about 100 jtl of a preparation at about 10 fig protein/ml are distributed into wells of a 
96-well polycarbonate ELISA plate. The plate is incubated for 2 hours at 37°C then overnight 
at 4°C. The plate is washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
(PBS/Tween buffer). The wells are saturated with 250 fi\ PBS containing 1% bovine serum 
5 albumin (BSA) to prevent non-specific antibody binding. After 1 hour incubation at 37 °C, the 
plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween 
buffer containing 0.5% BSA. 100 pi of dilutions are added per well. The plate is incubated 
for 90rninutes at 37°C, washed and evaluated according to standard procedures. For 
example, a goat anti-rabbit peroxidase conjugate is added to the wells when specific antibodies 
10 were raised in rabbits. Incubation is carried out for 90 minutes at 37°C and the plate is 
washed. The reaction is developed with the appropriate substrate and the reaction is measured 
by colorimetry (absorbance measured spectrophotometrically). Under the above experimental 
conditions, a positive reaction is shown by O.D. values greater than a non immune control 
serum. 

In a dot blot assay, a purified product is preferred, although a whole cell extract 
can also be used. Briefly, a solution of the product at about 100 »g/wl is serially two-fold 
diluted in 50 mM Tris-HCl (pH 7.5). 100 (A of each dilution are applied to a nitrocellulose 
membrane 0.45 fim set in a 96-well dot blot apparatus (Biorad). The buffer is removed by 
applying vacuum to tig-system. Wells are washed by addition uf 50 mM Tri s-HCl-feH^3)= 
and the membrane is air-dried. The membrane is saturated in blocking buffer (50 mM Tris- 
HCl (pH 7.5) 0.15 M NaCl, 10 g/L skim milk) and incubated with an antiserum dilution from 
about 1:50 to about 1:5000, preferably about 1:500. The reaction is revealed according to 
standard procedures. For example, a goat anti-rabbit peroxidase conjugate is added to the 
wells when rabbit antibodies are used. Incubation is carried out 90 minutes at 37 °C and the 
25 blot is washed. The reaction is developed with the appropriate substrate and stopped. The 
reaction is measured visually by the appearance of a colored spot, e.g., by colorimetry. 
Under the above experimental conditions, a positive reaction is shown once a colored spot is 
associated with a dilution of at least about 1:5, preferably of at least about 1:500. 

Therapeutic or prophylactic efficacy of a polypeptide or derivative of the invention 
30 can be evaluated as described below. 
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According to a seventh aspect of the invention, there is provided (i) a composition 
of matter containing a polypeptide of the invention together with a diluent or carrier; in 
particular, (ii) a pharmaceutical composition containing a therapeutically or prophylactically 
effective amount of a polypeptide of the invention; (iii) a method for inducing an immune 
5 response, ag^inst^Chimydiam inssa -mammal**, by* ^^adininisteBing^to^thetemammal an 
immunogenically -effective amount of ^a polypeptide* of the inventions *elieit an immune 
response, e.g., a protective immune resppnse to Chlamydia; and-partieularly^(iv) a method for 
preventing and/or treating a Chlamydia {e.g., C. trachomatis^C. psittaciy C. pneumoniae, or 
C. pecorum) infection, by administering a prophylactic or therapeutic amount of a polypeptide 
10 of the invention to an individual in need. Additionally, the seventh aspect of the invention 
encompasses the use of a polypeptide of the invention in the preparation of a medicament for 
preventing and/or treating Chlamydia infection, 
i The immunogenic compositions of the invention can be administered by any 

I conventional route in use in the vaccine field, in particular to a mucosal (e.g., ocular, 
• is intranasal, pulmonaiy^oral^gast^ or via 

\ the parenteral (e.g., subcutaneous^intradermal^ intravenous ^or intraperitoneal) 

I route. The choice of the^administration^^ such as 

the adjuvant associatedwith^ * mueosaltadjuvant is used, the 

; intnmasaTw^^ aluminum-compounds 
is used, the parenteralwoute^ Inathe^lattemea^ or 

intramuscular route-is most preferred*,* Thl^hoi^e^^ the^nature of the 

vaccine agent. For example, a polypeptide of the invention fused to CTB or LTB will be best 
administered to a mucosal surface. 

A composition of the invention can contain one or several polypeptides or 
25 derivatives of the invention. It can also contain at least one additional Chlamydia antigen, or a 
subunit, fragment, homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or derivative thereof can 
be formulated into or wi^Uposomesrpreferablyiineuttal^ot anionie^lipQgQmes, -microspheres, 
ISCOMS, or vinis4il^pai^ immune 
30 response. These *eompounds#areeread^^ see 
Liposomes: A Practical ApproaGh^wp^)*^ 
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Adjuvants other than liposomes and the like can also be used and are known in the 
art. A appropriate selection can conventionally be made by those skilled in the art, for 
example, from the list provided below. 

A<lministration can be achieved in a single dose or repeated as necessary at 
5 intervals as can be determined by one skilled in the art. For example, a priming dose can be 
followed by three booster doses at weekly or monthly intervals. An appropriate dose depends 
on various parameters including the recipient (e.g., adult or infant), the particular vaccine 
antigen, the route and frequency of administration, the presence/absence or type of adjuvant, 
and the desired effect (e.g., protection and/or treatment), as can be determined by one skilled 
10 in the art. In general, a vaccine antigen of the invention can be administered by a mucosal 
route in an amount from about 10 /ig to about 500 mg, preferably from about 1 mg to about 
200 mg. For the parenteral route of administration, the dose usually should not exceed about 
1 mg, preferably about 100 /xg. 

When used as vaccine agents, polynucleotides and polypeptides of the invention can 
be used sequentially as part of a multistep immunization process. For example, a mammal can 
be initially primed with a vaccine vector of the invention such as a pox virus, e.g., via the 
Ul parenteral route, and then boosted twice with the polypeptide encoded by the vaccine vector, 
L e.g., via the mucosal route. In another example, liposomes associated with a polypeptide or 
£ derivative ol the invention can also be used fui piiming, with boosting bcin g=eamed=oafc, 
33 20 mucosally using a soluble polypeptide or derivative of the invention in combination with a 
2 mucosal adjuvant (e.g., LT). 

A polypeptide derivative of the invention is also useful as a diagnostic reagent for 
detecting the presence of anti-Chlamydia antibodies, e.g., in a blood sample. Such 
polypeptides are about 5 to about 80, preferably about 10 to about 50 amino acids in length 
25 and can be labeled or unlabeled, depending upon the diagnostic method. Diagnostic methods 
involving such a reagent are described below. 

Upon expression of a DNA molecule of the invention, a polypeptide or polypeptide 
derivative is produced and can be purified using known laboratory techniques. For example, 
the polypeptide or polypeptide derivative can be produced as a fusion protein containing a 
30 fused tail that facilitates purification. The fusion product can be used to immunize a small 
mammal, e.g., a mouse or a rabbit, in order to raise antibodies against the polypeptide or 
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polypeptide derivative (monospecific antibodies). The eighth aspect of the invention thus 
provides a monospecific antibody that binds to a polypeptide or polypeptide derivative of the 
invention. 

By "monospecific antibody" is meant an antibody that is capable of reacting with a 
5 unique naturally*-oeeul^ An anfibody^bffthe invention can be 

polyclonal or monoclonal-. Monospecific antibodies can be recombinant, e.g., chimeric (e.g., 
constituted by a variable region of murine origin associated with a human constant region), 
humanized (a human immunoglobulin constant backbone-together-^withiihypeFvariable region of 
animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific 
10 antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab)'2 or Fab 
fragments. The antibodies of the invention can be of any isotype, e.g. , IgG or IgA, and 
polyclonal antibodies can be of a single isotype or can contain a mixture of isotypes. 
m The antibodies of the invention, which are raised to a polypeptide or polypeptide 

u derivative of the invention, can be produced and identified using standard immunological 
Ri5 assays, e.g., We*tempblot»analy,s^ et aL > 

O Current Protocols m^Immunota^ Ndw**York, NY). The 

ffi antibodies can be »used4n diagnostic methods -to deteetetheapreseneeaofa Chlamydia, antigen in 
L a sample, suehwaswa ^biologieaKsample.** m^antibo^ies«.can«also*9pe*»used in affinity 

chromatography- methods^foi^-pui-i^ing^a polypeptid e ^! polypeptide derivative of t he_ 
jy 20 invention. As»4s*dis<ms»^*fu^^ 
m therapeutic passiy.e*immunizationimethods«i»t. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for detecting the 
presence of Chlamydia in a biological sample that contains an antibody, polypeptide, or 
polypeptide derivative of the invention; and (ii) a diagnostic method for detecting the presence 
25 of Chlamydia in a biological sample, by contacting the biological sample with an antibody, a 
polypeptide, or a polypeptide derivative of the invention, such that an immune complex is 
formed, and by detecting such complex to indicate the presence of Chlamydia in the sample or 
the organism fromwwhieh<ithe«sample»is»deBived.««i» 
Those*skdUe*d«m*me»art*w^^ 
30 a component-of«me*saiHple*and*^ -derivative, 
whichever is used)»^d«giat»an^^ me 
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complex. As can be easily understood, a polypeptide reagent is useful for detecting the 
presence of ^-Chlamydia antibodies in a sample, e.g., a blood sample, while an antibody of 
the invention can be used for screening a sample, such as a gastric extract or biopsy, for the 
presence of Chlamydia polypeptides. 
5 For use in diagnostic applications, the reagent (i.e., the antibody, polypeptide, or 

polypeptide derivative of the invention) can be in a free state or immobilized on a solid 
support, such as a tube, a bead, or any other conventional support used in the field. 
Immobilization can be achieved using direct or indirect means. Direct means include passive 
adsorption (non-covalent binding) or covalent binding between the support and the reagent. 
10 By "indirect means" is meant that an anti-reagent compound that interacts with a reagent is 
first attached to the solid support. For example, if a polypeptide reagent is used, an antibody 
that binds to it can serve as an anti-reagent, provided that it binds to an epitope that is not 
£fl involved in the recognition of antibodies in biological samples. Indirect means can also 
fcf employ a ligand-receptor system, for example, a molecule such as a vitamin can be grafted 
S 15 onto the polypeptide reagent and the corresponding receptor can be immobilized on the solid 
P. phase. This is illustrated by the biotin-streptavidin system. Alternatively, indirect means can 
S be used, e.g., by adding to the reagent a peptide tail, chemically or by genetic engineering, 
L and immobilizing the grafted or fused product by passive adsorption or covalent linkage of the 

1^ peptide tail. == 
5 jo According to a tenth aspect of the invention, there is provided a process for 

S purifying, from a biological sample, a polypeptide or polypeptide derivative of the invention, 
which involves carrying out antibody-based affinity chromatography with the biological 
sample, wherein the antibody is a monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be polyclonal or 
25 monospecific, and preferably is of the IgG type. Purified IgGs can be prepared from an 
antiserum using standard methods (see, e.g., Coligan et al., supra). Conventional 
chromatography supports, as well as standard methods for grafting antibodies, are disclosed 
in, e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988). 

Briefly, a biological sample, such as an C. pneumoniae extract, preferably in a 
30 buffer solution, is applied to a chromatography material, preferably equilibrated with the 
buffer used to dilute the biological sample so that the polypeptide or polypeptide derivative of 
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the invention (i.e., the antigen) is allowed to adsorb onto the material. The chromatography 
material, such as a gel or a resin coupled to an antibody of the invention, can be in batch form 
or in a column. The unbound components are washed off and the antigen is then eluted with 
an appropriate elution buffer, such as a glycine buffer or a buffer containing a chaotropic 
5 agent, e.g., guanidine HC1, or high salt concentration (e.g., 3 M MgCla). Eluted fractions are 
recovered and the presence of the antigen is detected, e.g., by measuring the absorbance at 
280 nm. 

An antibody of the invention can be screened for therapeutic efficacy as described 
as follows. According to an eleventh aspect of the invention, there is provided (i) a 
10 composition of matter containing a monospecific antibody of the invention, together with a 
diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or 
prophylacticaUy effective amount of a monospecific antibody of the invention, and (iii) a 
m method for treating or preventing a Chlamydia (e.g., C. trachomatis, C. psittaci, C. 
2 pneumoniae or C. pecorum) infection, by adrninistering a therapeutic or prophylactic amount 
Hi5 of a monospe©ifie%anubody t ^ an individual in need. Additionally, the 

Q eleventh aspect of the invention- encompasses the use of a monospecific antibody of the 
m invention in the preparation of a medicament for treating or preventing Chlamydia infection. 
; ; To this end, the monospecific antibody cm be polyclonal or monoclonal, preferably 

of the IgA isotype (predominantly): fir-passive iuimunizationr^he antibody can— be= 
jlj 20 administered to a mucosal, surfaced .a mammal.-.e.g., the gastric mucosa, e.g., orally or 
£ intragastrically, advantageously., in the presence of a bicarbonate, .buffer.- Alternatively, 
systemic administration, not requiring a bicarbonate buffer, can be carried out. A 
monospecific antibody of the invention can be adnunistered as a single active component or as 
a mixture with at least one monospecific antibody specific for a different Chlamydia 
25 polypeptide. The amount of antibody and the particular regimen used can be readily 
determined by one skilled in the art. For example, daily administration of about 100 to 1,000 
mg of antibodies over one week, or three doses per day of about 100 to 1,000 mg of 
antibodies over two or three days, can be an effective regimens for most purposes. 

Therapeutic or prophylactifeefficacy, can be-evaluatedsusingestandaEd-methods in the 
30 art, e.g., by«measurmgcmdutetion?ofva mu©osal6*imnaane*response or induction, of protective 
and/or meraptotic*iirmum^ Those skUled m 
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the art will recognize that the C. pneumoniae strain of the model can be replaced with another 
Chlamydia strain. For example, the efficacy of DNA molecules and polypeptides from C. 
pneumoniae is preferably evaluated in a mouse model using an C. pneumoniae strain. 
Protection can be determined by comparing the degree of Chlamydia infection to that of a 
control group. Protection is shown when infection is reduced by comparison to the control 
group. Such an evaluation can be made for polynucleotides, vaccine vectors, polypeptides and 
derivatives thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions described above are as follows. 
Adjuvants for parenteral adrninistration include aluminum compounds, such as 
aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen 
can be precipitated with, or adsorbed onto, the aluminum compound according to standard 
protocols. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can be used in 
parenteral administration. 

Adjuvants for mucosal administration include bacterial toxins, e.g., the cholera 
toxin (CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A and the 
pertussis toxin (PT), or combinations, subunits, toxoids, or mutants thereof . For example, a 
purified preparation of native cholera toxin subunit B (CTB) can be of use. Fragments, 
homologs, derivatives, and fusions to any of these toxins are also suitable, provided that they 

educed toxicity is used. Suitable- 
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retain adjuvant activity. Pfefei 
mutants are described, e.g., in WO 95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192- 
Gly LT mutant), and WO 95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT 
mutants that can be used in the methods and compositions of the invention include, e.g., Ser- 
63-Lys, Ala-69-Gly, Glu-110-Asp, and Glu-112-Asp mutants. Other adjuvants, such as a 
bacterial monophosphoryl lipid A (MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella 
typhimurium, or Shigella flexneri; saponins, or polylactide glycolide (PLGA) microspheres, 
can also be used in mucosal administration. 

Adjuvants useful for both mucosal and parenteral administrations include 
polyphosphazene (WO 95/2415), DC-chol (3 b-(N-(N\N' -dimethyl aminomethane)- 
carbamoyl) cholesterol; U.S. Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 
88/9336). 
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Any pharmaceutical composition of the invention, containing a polynucleotide, a 
polypeptide, a polypeptide derivative, or an antibody of the invention, can be manufactured in 
a conventional manner. In particular, it can be formulated with a pharmaceutically acceptable 
diluent or carrier, e.g., water or a saline solution such as phosphate buffer saline. In general, 

5 a diluent or carrder*<*anwbe*selee^ 311(1 
standard "pharmaceutdeal^praetice. Suitable phaimaceutieal caprders^ well as 

pharmaceutical .necessities for their use- in pharmaceutical formulationstte are^deserdbed in 
Remington's Phtirmaeeutical^Scienees, a standard; reference- te^^ the 
USP/NF. 

10 The invention also includes methods in which Chlamydia infection, are treated by 

oral administration of a Chlamydia polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a combination thereof. Examples of 
such compounds that can be administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and derivatives thereof (specific 
15 examples of antibiotics* ° r 
immunomodulators such^as cytokines or steroids - In addition, compounds*containing more 
than one of the*aboye-list^*cpmp^ents»co^ also 
includes campQsitioiig^for^ a 
~CHt amydia antigen*(or^antigens)^ 
listed compQunds, in»a*phafcma 

Amounts ot the^aboye-listed^compoundsiuse^ in the^methods^andi^mpositions of 
the invention can readily be determined by one skilled in the art. In addition, one skilled in 
the art can readily design treatment/immunization schedules. For example, the non-vaccine 
components can be administered on days 1-14, and the vaccine antigen 4- adjuvant can be 
25 administered on days 7, 14, 21, and 28. 
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Figure 1 CPN100626 Coding sequence 



m 



tcctgaactc 
tgtgacctac 
cactctct cc 
cgtggcgagt 
tctagtgtca 
gttcgggatg 
agagagggat 
aggattttct 
actttctaac 
tgtgtcctgt 
tctcgcgatt 
tagaggaccc 
aactcgtggg 
tttctacttt 
ctgccgcatc 
agggggtgcg 
taagaacaac 
aaataactcc 
aggaaatggt 
aactattctt 
tttgacaatc 
tgctcttatg 
atttcaaaat 
accaaatagc 
tatagaacac 
gggaacgatc 
cattagcagc 
tgcgggatgg 
ggcgagtatt 
catcattaat 
cttgtggatc 
tactttatca 
agatctctct 
tcatatcaat 



cactcgaaat 
aatgctttag 
ttagattact 
agaatgagat 
aagactcctc 
actcctgcag 
gaagagtttc 
ccaataacta 
tacaagtcta 
aataatttct 
gggactggag 
cttatttttt 
ggtgcgattg 
gtcaacaatt 
caaagcaaca 
cttcgtagtg 
tgtgggaaca 
gggtcggtga 
tcaggagggg 
ttcaataata 
aaaaatagtg 
ctcctacagg 
aatgaggttt 
aacttacaac 
caacatccaa 
ttattttctt 
tcgaaaaata 
caattctata 
gcaacaactg 
aaccttgcga 
cgtcctctac 
ggtcctctga 
gagcctttac 
accgataact 
tggtctcuLl 
aacaccctct 
gaataccaag 
ctattaagaa 
caagggattg 
cgtatccaat 
atctccttag 
gtctcggctc 
tttgcaacat 
cgtcacatca 
ggctgttctt 
attgcaatac 
gtctctcaaa 
tcaaaattcc 
caacaaaatc 
ggccataact 
cgttctctcg 
catctccaag 
ttagaattaa 
atttaaataa 



tactgattag 

ggatcaaagt 

ctgcggatat 

ttttaacaat 

ctaagttttt 

tgtatagttt 

gtacgagctt 

cgtttgttgg 

ttgataacat 

tattatcaaa 

gcgcgattgc 

tcagcaatcg 

cctgtaatgg 

ccgtcaacaa 

gggcacctct 

aaaatacaac 

atggcggggc 

ttttcaataa 

cgatttatac 

actactgcat 

gccacgtata 

acagcacctg 

tcctcaccac 

ttggagctaa 

ctacaaatcc 

cagcctatat 

cctctgaact 

agttcactca 

ccaactctga 

ttaacctccc 

aatctagtgc 

cactcttaaa 

aaaacattca 

ttcatcctga 

aLLggg t a g a 

acagagctct 

gagatcttgc 

gttataatcg 

ccgacggcct 

ctacagggta 

gttttgcaca 

acaatacagt 

cccacagttt 

agaacagggc 

tcccttggca 

gttctcacca 

agcctttcta 

acgtacctac 

cccaaatcgg 

atgttcgcaa 

atctattctt 

caggaagtac 

caactatccg 



ccaaggtacg 

gaaaaatacc 

ttcttcctcc 

aagtgaccaa 

attctatctc 

acaaacggac 

tcctctctta 

aaatagacat 

ccttcttctt 

tgttgaagac 

ttgccaggga 

aggtcttaac 

agacttcacg 

ctggggagga 

actctttttt 

gatctctgat 

cattcaaaca 

caacacagcg 

aacaaaccta 

tcgcgatggc 

tttcaccaac 

cctactcttc 

atttggtaga 

taaggggtat 

tctaatcttt 

cccagaagct 

tcgcaatggt 

aaaaggaggt 

gactccatca 

ctcgatctta 

tcctttcaca 

tgaggaaaac 

tcttctttct 

aagcttaaat 

gac g at -aaca- 

gtatgccaat 

tacgactccc 

aactggtgat 

ctttgttcat 

ttccttacaa 

gttcttcacc 

ctcttcactt 

agcgtatggc 

agaagggacg 

acagaaatcc 

aacagcgttc 

taatctgacc 

agaatggact 

tgtcacgcta 

tgctttaggg 

ggattaccaa 

cttaaaattc 

atgagctacg 



tggacgacgc 
atgcaggtgt 
acgctgagtc 
aacagaaaga 
gggaatttca 
tcccttgaaa 
gactctctct 
aattcctctc 
tggacatcgg 
catgccttct 
gcctgcacaa 
aatgcgagta 
atttctcaaa 
gccctctcca 
aacaatacag 
aacacgcgtc 
agcgttactg 
ttatctggtt 
tccatagacg 
ggagctatct 
aatcaaggaa 
gcggaacaag 
tacaacgcca 
acgactgctt 
aatcccaatg 
tctgactacg 
gtcctctcta 
atccttaaat 
actagtgtag 
gcaaaaggaa 
gaggacaata 
cgcgatccct 
ttatcggatg 
gcgactgagc 
acaacaaata 



aggccactcc 
ttcctaaagt 
actacttaaa 
ttaaggaacc 
cagcctgcat 
agtttgcttt 
ccactcttac 
aagacattgt 
ctgggggagc 
tcagtaaaaa 
tcacgaagaa 
caggaggaga 
atcaagggac 
ccaatggaca 
cccctagtgg 
ctatttattt 
ttgcgataaa 
cgataaattc 
ataaccctgg 
gtacacaatt 
actggggagg 
gaaatatcgc 
tacattgtac 
tttttgatcc 
cgaaccatca 
aaaataattt 
tcgaggatcg 
tagggcatgc 
getcccaggt- 
aagctcctac 
accctacaat 
acgacagtat 
taacagcacg 
attacggtta 
acg ct Lctat 
taggatataa 
cctttcatac 
agaggccttt 
tccccggggc 
aaacttcttt 
aaatcggatc 
ttccgtggtt 
atcacctcca 
atacattagc 
tcagcccgtt 
gtgacaatcc 
gaatccaagg 
cttaccaacc 
gaggttcctg 
acaatcaaac 
cctcctcgac 
gaacgataaa 
cggtagagga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

17 4 0 

1800 

1860 

1920 

1980 

2040 

2100- 



tcaaggcatc 
agagacggca 
ggtcaatcct 
tatgttctct 
cttagaaatt 
tccaggattc 
acatcagaaa 
aagcaacaac 
ccaagaggcc 
cgcctacatc 
agcagctatc 
cgttcaggca 
ccgaaagttt 
aaaatggcag 
ggtactctat 
ggatatccta 
tgcgctcttc 
atctacgcac 
attgaaatct 

ctccctcaaa 



tggactccct 
ctatggcaat 
tctgatatcg 
caaaatagca 
gcatcctccg 
cgcactaaag 
tatgttgagc 
tatggggacc 
tgttatagcc 
tatcttcacc 
gaagagattg 
ttacctctag 
ctagaacttt 
cttgcgagcg 
tacaaagtcc 
ggatcggtct 
taaaataaaa 
ttagcccaat 



2160 
2220 
2280 
234 0 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3200 



33 



Figure 2 CPN 100626 Deduced Protein sequence 



Met Gin Val Phe Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp 
15 10 15 

3 1e Ser Ser Ser Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met 
20 25 30 

Arg Phe Leu Thr* lie Se***Asp^Gln Asiv*Arg^Lys*-He Lys^Gluv Pr'6*-Leu^ 
35 40 45 

Val Ser Lys Thr Pro Pro Lys Phe Leu Phe Tyr Leu Gly Asn Phe Thr 
50 ' 55 60 . 

Ala Cys Met Phe^Gly Met Thr Pro Ala Val Tyr Ser Leu Gin Thr Asp- 

65 70 75 80 

Ser Leu Glu Lys Phe Ala Leu Glu Arg Asp Glu Glu Phe Arg Thr Ser 
85 90 95 

Phe Pro Leu Leu Asp Ser Leu Ser Thr Leu Thr Gly Phe Ser Pro He 
100 105 HO 

Thr Thr Phe Val Gly Asn Arg His Asn Ser Ser Gin Asp He Val Leu 
115 120 125 

Ser Asn Tyr Lys Ser He Asp Asn He Leu Leu Leu Trp Thr Ser Ala 
130 135 140 

Gly Gly Ala Val- Ser-< Cys*, Asn. Asn<*Phe**Leu^Leu*Se^ 

145 150 155V. 160 

His Ala Phe Phe Ser . Lysv,Asn^Leu- f Ala He Gly. Thr- Gly Gly ,-Al-a, lie 
165 - 170 175: 

Ala Cys Gin Gly. Ala^Gys^Thr^Ile ThraLys^Asn^ 

rau— -195 Mfl = 



Phe Phe Ser Asn-Arg-Gl^Leu^Asn^Asn^Ala^Ser^Thr >Gly. Gly v; Glu Thr^- 
195 * 200^ 205 *• 

Arg Gly Gly Ala He Ala Cys Asn Gly Asp Phe Thr* He Ser Gin Asn 
210 215 220 

Gin Gly Thr Phe Tyr Phe Val Asn Asn Ser Val Asn Asn Trp Gly Gly 
225 230 235 240 

Ala Leu Ser Thr Asn Gly His Cys Arg He Gin Ser Asn Arg Ala Pro 
245 250 255 

Leu Leu Phe Phe Asn Asn Thr Ala Pro Ser Gly Gly Gly Ala Leu Arg 
260 265 270 

Ser Glu Asn Thx Thr He Ser Asp Asn Thr Arg Pro He Tyr Phe Lys 
275 280 285 

Asn Asn Cys Gly Asn ;Asn* Gly^Giy-Afca^*^ 

290 295'-*- 300^ 

Ala lie Lys Asn^AswSer^Gl^ -Ser^Val^ 

305 310 v a: 3151* 320«e 

Leu Ser Gly Ser He Asn Ser Gly Asn Gly Ser Gly Gly Ala He Tyr 

34 



325 



330 



335 



Thr Thr Asn Leu Ser lie Asp Asp Asn Pro Gly Thr lie Leu Phe Asn 
340 345 350 

Asn Asn Tyr Cys He Arg Asp Gly Gly Ala He Cys Thr Gin Phe Leu 
355 360 365 

Thr He Lys Asn Ser Gly His Val Tyr Phe Thr Asn Asn Gin Gly Asn 
370 375 380 

Trp Gly Gly Ala Leu Met Leu Leu Gin Asp Ser Thr Cys Leu Leu Phe 
385 390 395 400 

Ala Glu Gin Gly Asn He Ala Phe Gin Asn Asn Glu Val Phe Leu Thr 
405 410 415 

Thr Phe Gly Arg Tyr Asn Ala He His Cys Thr Pro Asn Ser Asn Leu 
420 425 430 

Gin Leu Gly Ala Asn Lys Gly Tyr Thr Thr Ala Phe Phe Asp Pro He 
435 440 445 

Glu His Gin His Pro Thr Thr Asn Pro Leu He Phe Asn Pro Asn Ala 
450 455 460 

Asn His Gin Gly Thr He Leu Phe Ser Ser Ala Tyr He Pro Glu Ala 
465 470 475 480 

Ser Asp Tyr Glu Asn Asn Phe He Ser Ser Ser Lys Asn Thr Ser Glu 
485 490 495 

Leu Arq Asn Gly Val Leu Ser He Glu Asp Arg Ala Gly Trp Gin Phe 
500 505 510 

Tvr Lys Phe Thr Gin Lys Gly Gly He Leu Lys Leu Gly His Ala Ala 
515 520 525 



Ser He Ala Thr Thr Ala Asn Ser Glu Thr Pro Ser Thr Ser Val Gly 
530 535 540 

Ser Gin Val He He Asn Asn Leu Ala He Asn Leu Pro Ser He Leu 
545 550 555 560 

Ala Lvs Gly Lys Ala Pro Thr Leu Trp He Arg Pro Leu Gin Ser Ser 
565 570 575 

Ala Pro Phe Thr Glu Asp Asn Asn Pro Thr He Thr Leu Ser Gly Pro 
580 585 590 

Leu Thr Leu Leu Asn Glu Glu Asn Arg Asp Pro Tyr Asp Ser He Asp 
595 600 605 

Leu Ser Glu Pro Leu Gin Asn He His Leu Leu Ser Leu Ser Asp Val 
610 615 620 

Thr Ala Arg His He Asn Thr Asp Asn Phe His Pro Glu Ser Leu Asn 
6 25 630 635 640 

Ala Thr Glu His Tyr Gly Tyr Gin Gly He Trp Ser Pro Tyr Trp Val 
645 650 655 

Glu Thr He Thr Thr Thr Asn Asn Ala Ser He Glu Thr Ala Asn Thr 
660 665 670 

35 
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Leu Tyr Arg Ala Leu Tyr Ala Asn Trp Thr Pro Leu Gly Tyr Lys Val 
675 680 685 

Asn Pro Glu Tyr Gin Gly Asp Leu Ala Thr Thr Pro Leu Trp Gin Ser 
690 695 700 

Phe His Thr Met Phe Ser Leu Leu Arg Ser Tyr Asn Arg Thr Gly Asp 
705 710 715 720 

Ser Asp lie Glu Arg Pro Phe Leu Glu lie Gin Gly lie Ala Asp Gly, 
725 730 735 

Leu Phe Val His Gin Asn Ser lie Pro Gly Ala Pro Gly Phe Arg He* 

740 745 750 

Gin Ser Thr Gly Tyr Ser Leu Gin Ala Ser Ser Glu Thr Ser Leu His 
755 760 765 

Gin Lys He Ser Leu Gly Phe Ala Gin Phe Phe Thr Arg Thr Lys Glu 
770 775 780 

lie Glv Ser Ser Asn Asn Val Ser Ala His Asn Thr Val Ser Ser Leu 
785 790 795 800 

Tyr Val Glu Leu Pro Trp Phe Gin Glu Ala Phe Ala Thr Ser His Ser 
805 810 815 

Leu Ala Tyr Gly^Tyr Gly Asp His His Leu His Ala Tyr He Arg His 
$20m 82*S'r^ 830*# 

He Lys Asn Arg, Al*a ,Glu Gly Thr- Cys Tyr Ser- His Thr .Leu^Ala Ala» 
835 840 845 

Ala He Gly Cys wSe1^*Ph*e^"Pro- Trp%Glh. Gin <Lys Ser^Tyr^Leu^His Leu- 
850 855.** 8 6 ©r 

Ser Pro Phe VdTTGln^Al-a— rie Mttt-Ho Arg Ser^Hifs Glir-T^r-Al-a---Phe 
865 870" 875 - 880*-. 

Glu Glu He Gly^AspjfcAsn^Pro^Arg^ Lysj^Pro^Phew* 
8 85** 8 90^ 8 95r^ 



Tyr Asn Leu Thr Leu Pro Leu Gly He Gin Gly Lys Trp Gin Ser Lys 
900 905 910 

Phe His Val Pro Thr Glu Trp Thr Leu Glu Leu Ser Tyr Gin Pro Val 
915 920 925 

Leu Tyr Gin Gin Asn Pro Gin He Gly Val Thr Leu Leu Ala Ser Gly 
930 535 940 

Glv Ser Trp Asp He Leu Gly His Asn Tyr Val Arg Asn Ala Leu Gly 
945 950 955 960 

Tvr Lvs Val His Asn. Gin Thr Ala Leu Phe Arg Ser Leu Asp Leu Phe 
965^5? 9T0m 97'5«* 

Leu Asp Tyr GlnwG:fcy«Se%^V^ 

9803C 985*- 990X 

Gin Ala Gly Se^Th'f^fceu^Isys^ 

995 1000^' 
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Figure 3 

Restriction enzyme analysis of CPN100626 



MslI 
Cjel 
Hgal 



Hpyl78III 



Tsp509I 
TaqI 1 Cjel 
I I 



BsaJI 
Styl 
Cvi JI I 
I I 



BsaAI 
Maell) 
Rsal I I 
I I 1 



CviJI | 

Hael | 

Haelll I 

Cac8I I I 

I I 1 



TCCTGAACTCCACTCGAAATTACTGATTAGCCAAGGTACGTGGACGACGCAGGCCACTCC 

+ + + + + + 60 

AGGACTTGAGGTGAGCTTTAATGACTAATCGGTTCCATGCACCTGCTGCGTCCGGTGAGG 



Maelll 
Tsp4 5I 



61 



O 



Aarl 
MslI I 

Dpnl BspMI CviRII I Maelll 

Sau3AI | Alwl I Nlallll I Tsp45I 

I I I I Ml t 

TGTGACCTACAATGCTTTAGGGATCAAAGTGAAAAATACCATGCAGGTGTTTCCTAAAGT 

ACACTGGATGTTACGAAATCCCTAGTTTCACTTTTTATGGTACGTCCACAAAGGATTTCA 



120 



Ddel 



BseMIX 
Bsp24I I 
MboII CjePI I 
Acill Cjel | I 
It I I 



Maelll 
Tsp45I 
Hinf I ! 
Mnll I 
Ddel I I 
I I I 



Mae 1 1 
Msel I 
Plel I I 
I I f 



121 



CACTCTCTCCTTAGATTACTCTGCGGATATTTCTTCCTCCACGCTGAGTCACTACTTAAA 

.4.—— — — — — — +— — — — — — — — — -i —+«.— —— — — — — — __— — + 180 

GTGAGAGAGGAATCTAATGAGACGCCTATAAAGAAGGAGGTGCGACTCAGTGATGAATTT 



Cjel 



CjePI ^flroTT^ 

Bsp24I| Msel Tsp45I 

II t I 

CGTGGCGAGTAGAATGAGATTTT TAACAAT AAGT GACCAAAACAGAAAG AT TAAGG AACC 

181 + + + + + -+ 240 

GCACCGCTCATCTTACTCTAAAAATTGTTATTCACTGGTTTTGTCTTTCTAATTCCTTGG 



Msel NlalV 
I I 



Hinfl 
Mnll 
Plel I 
BseRI 1 I 
Bfal I I I 
I III 



241 



Apol CviRI 

Tsp509I Cac8I | 

Sthl32I Hpyl78III I Sthl32I | 

Ddel Mnll I Aval I I CviJI I I 

, . . I II III I I I 

TCTAGTGTCAAAGACTCCTCCTAAGTTTTTATTCTATCTCGGGAATTTCACAGCCTGCAT ^ 



AGA' 



TCACAGTTTCTGAGGAGGATTCAAAAATAAGATAGAGCCCTTAAAGTGTCGGACGTA 



BtsI 

Hpyl78III Fokl 
Plel I PstI 
Nlalll I I CviRI I 

Nspl | I Hinfl Sfcl I ITspRI Plel Hinfl 
III I III I I I 

GTTCGGGATGACTCCTGCAGTGTATAGTTTACAAACGGACTCCCTTGAAAAGTTTGCTTT 
301 CAAGCCC TACTGAGGACG T CACATAT CAAATGT TTGCCTGAGGGAACTT T TCAAACGAAA 



Mnll 
I 



+ 360 
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V 



Hin4I 



BplI 



Earl 
I 



XmnI 
I 



Alul 
CviJI 
MboII I 
Rsall I 
Foki | | I 
SunI | I I 
I I I I 



BsaXI 
Hin4I 
Mnll 
Hinfll 
Ddel [ | 
Plel | I I 
I I II 



361 +~ + + +— — + + 420 

TC T C T CCCT ACT T CTCAAAGGAT G CTOGAAAGG AGA©AA<n<ST GAGAGAGAGGT^GAGA^RG 

Hpy4a8III Rsa»I~ 
C jei Tsp50-9*I^ SmM | Tat-I I 

Mme*** I Mael-I Bee 833b;.. ICjeis^ | lMn-1-I^ I I 

| | I I 1 I I I I I I 

AGGATTTTCTCCAATAACTACGTTTGTTGGAAATAGACATAATTCCTCTCAAGACATTGT 

421 + + + + + + 480 

TCCTAAAAGAGGTTATTGATGCAAACAACCTTTATCTGTATTAAGGAGAGTTCTGTAACA 



Foki 



MboII 



481 



Bsp24I Alul 
Cjei CviJI CviJI 

CjePI Acelll I Mwol I 

I II I I I I 

ACTTTCTAACTACAAGTCTATTGATAACATCCTTCTTCTTTGGACATCGGCTGGGGGAGC 

+ + + 540 

TGAAAGATTGATGTTCAGATAACTATTGTAGGAAGAAGAAACCTGTAGCCGACCCCCTCG 



Tsp5,09l 
Cj-ei*** I 
CjeB*t I 

Bsp2<ir 1 1 I 
I ) i I 



MboU 
Nl'^WfiKK*' I 
Bbs?I | I 
Eco57-P MboXI I 1 I 
I III! 



CjePI 
I 



541 acac&ggag&tWt*^^ 



Mnll 
Hpyl78lII I 

CviRI I I 

Cac8I I I I 

CviJI || II 

NlalVI II II 

Bpml I I I I II 

ScrFI I lilt II 

BsaJI II I I I I II 

Ec^MWIll MM II 

Caci^l Ml I I I I II 

C j:eB*»l| HI I I I I II 

BstyMt I I I 11 II I I II 

Hpyl 7 8*1*1*1" Bs»2^I^BsmF/I<^ I I I IN* Mil I I 

nj?u»iw Ttm»M^imhm^i 1 i 1 1 w Mil I I 

Thai Mnll * I I Thai I I I I I I I I I I I II 

| t II I I I I I 1 M I I I I II 
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601 



TCTCGCGATTGGGACTGGAGGCGCGATTGCTTGCCAGGGAGCCTGCACAATCACGAAGAA 

+ + + + + + 660 

AGAGCGCTAACCCTGACCTCCGCGCTAACGAACGGTCCCTCGGACGTGTTAGTGCTTCTT 



MboII 
NlalV 
Avail 
ECOO109I 
PspSII 
Sau96I 
SimI 



Mnll TaqI 



Msel 



Rsal 
Mnll | 
TatI | 



TAGAGGACCCCTTATTTTTTTCAGCAATCGAGGTCTTAACAATGCGAGTACAGGAGGAGA 



661 



+ 720 



ATCTCCTGGGGAATAAAAAAAGTCGTTAGCTCCAGAATTGTTACGCTCATGTCCTCCTCT 

BssSX BseRI BsmAI Hpyl78lII 

II I 1 

AACTCGTGGGGGTGCGATTGCCTGTAATGGAGACTTCACGATTTCTCAAAATCAAGGGAC 

721 + + + + + + 780 

TTGAGCACCCCCACGCTAACGGACATTACCTCTGAAGTGCTAAAGAGTTTTAGTTCCCTG 

Banll 
Bspl286I 

BsrI 
Biarl I CviJI 
Mnll I |Hin4I| 
Hindi I I !NlaIV| 



Tsp50 9I 
Hindi I 
BsmFI I I 



Fokl 
Mnll 
BseRI I BplI 



ill I III M 

TTTCTACTTTGTCAACAATTCCGTCAACAACTGGGGAGGAGCCCTCTCCACCAATGGACA 



781 



+ 840 



AAAGATGAAACAGTTGTTAAGGCAGTTGTTGACCCCTCCTCGGGAGAGGTGGTTACCTGT 



IJ1 



m 



Bspl286I BslI 
NlalVI EcoNI 
Bmgl | | Bf al I 

BseSIl j Msel Mnll I I 

BanI | | | Mnll I CviJI | | | 

I I I I II I II I 

CTGCCGCATCCAAAGCAACAGGGCACCTCTACTCTTTTTTAACAATACAGCCCCTAGTGG 

+ + + + + + 

GACGGCGTAGGTTTCGTTGTCCCGTGGAGATGAGAAAAAATTGTTATGTCGGGGATCACC 



TspRI 
Acil I 
Fnu4HI | 
Taul I 
BtsI I I 
I I 



841 



SfaNI 
Mwol I 
1 I 



900 



Pflll08I 
Hhal I 



Hpyl88IX 
Hgall 
Dpnl I I 
Sau3AI I I I 
II II 



Thai 
AflHI I 
Mlul I 
I I 



CjePI 
I 



Msel 
I 



AGGGGGTGCGCTTCGTAGTGAAAATACAACGATCTCTGATAACACGCGTCCTATTTATTT 



901 



+ 960 



TCCCCCACGCGAAGCATCACTTTTATGTTGCTAGAGACTATTGTGCGCAGGATAAATAAA 



Faul 
Sthl32I| 



CviJI 
Haelll 
CjePI i 
NlalV | 
Sau96I | | 



Tthlllll 
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RleAI Taal I I Acil I I I Maelll Taal | 

I IN I I I I i I I 

TAAGAACAACTGTGGGAACAATGGCGGGGCCATTCAAACAAGCGTTACTGTTGCGATAAA 

961 + + + + + + 1020 

ATTCTTGTTGACACCCTTGTTACCGCCCCGGTAAGTTTGTTCGCAATGACAACGCTATTT 

Siml Apol 
Neil | Tsp509I 
ScrEI I Taql^. I 

Sthl32I MspOM I Hph-P Bsbl * Drdl^I I I 

i Ml I I 111 

AAAT AAC T CC GGGTCGG T GAT«T TTCAAT AAG AACAC^GSGTT ATCTGGT*TGGA 

1021 + + — + + +~ + 1080 

TT TAT TG AGGGCGAGCGAC T AAAftGT T ATTGTTGTGTCGCAATAGACCAAG <STAT*TT AAG - 

Hpyl78III 

Drdll I ScrFI 

Mnlll I BsaJI I 

Hpyl78III Xmnlll I EcoRII | 

I I I I I II 

AGGAAATGGT TCAGGAGG GGCGATT TAT ACAACAAACC TATCCAT AGACGAT AAC CCTGG 

1081 + + + + + + 1140 

TCCTTTACCAAGTCCTCCCCGCTAAATATGTTGTTTGGATAGGTATCTGCTATTGGGACC 



Alul 
CviJI 
Ecil. 
Acil | 



I I I 

I I I 

I I I 

t I I 



Hpyl78III 
Bsra-I^.Nru-I. 
CviRI ■ Thai- 
I I 



Rsa-I* 
Bsi?G ( I>v.| 
Tat-I I 
I I 



I 



AACTATTCTTTTCAATAATAACTACTGCATTCGCGATGGCGGAGCTATCTGTACACAATT 

1141 + + + + + + 1200 

TTGATAAGAAAAGTTATTATTGATGACGTAAGCGCTACCGCCTCGATAGACATGTGTTAA 

BsaAI 
HphI 
Maell 
CviJI 
Hael 

Haelll | | BsrI 
MscI | I Bmrl I 

Eael I I \ Mnll I I 

I I I I III 
T T TGACAATCAAAAAT AGTGGGC ACGT ATAT T TC ACCAACAATCAAGGAAACT GGGGAGG 

AMCTGT TAGTTTTT*£T©#G^ 



BsiHKAI 
Bspl286I 



Mb©I-I . 
Aar l I 



Aci-I 
EarrI 



I I II I I 

TGCTCTTATGCTCCTACAGGACAGCACCTGCCTACTCTTCGCGGAACAAGGAAATATCGC 

40 



RY45 101998 
+ 1320 



1261 + + + 

ACGAGAATACGAGGATGTCCTGTCGTGGACGGATGAGAAGCGCCTTGTTCCTTTATAGCG 

Rsal 

Cjel BsrGI | 

Cjel | Mnll Cjel I I 

Mnll HphI I HgiEIII Cjel Cjel | TatI I 

| || II I 1111 

ATT TCAAAATAATGAG GTTTTCCTCACCACAT T T GGTAGATACAACGCCAT ACAT TGTAC 

1321 + + + + + + 1380 

TAAAGTTTTATTACTCCAAAAGGAGTGGTGTAAACCATCTATGTTGCGGTATGTAACATG 

Fokl 
Sfcl 
Dpnl I 

Alul BstZl7I Sau3AI I I 

Ciel CviJI Accll Alwl I 1 I 

I I II 1111 

ACCAAATAGCAACTTACAACTTGGAGCTAATAAGGGGTATACGACTGCTTTTTTTGATCC 

1381 + + + + + + 1440 

TGGTTTATCGTTGAATGTTGAACCTCGATTATTCCCCATATGCTGACGAAAAAAACTAGG 



1441 



Mmel 

Mnll I Bed 
Ms el I Drdll I 

II It 
TATAGAACACCAACATCCAACTACAAATCCTCTAATCTTTAATCCCAATGCGAACCATCA ^ 

ATATCTTGTGGTTGTAGGTTGATGTTTAGGAGATTAGAAATTAGGGTTACGCTTGGTAGT 

Pflll08l 



5; 


MboII 
Dpnl 1 




Hpyl88IX 
Alul 1 
CviJI |= 




! — 


5auJ7nr~l 1 
Eco57I J I 1 
1 1 It 


CviJI 
1 


Hindlll I 1 
1 1 1 


ITsp509I 
1 1 



1501 



GGGAACGATCTTATTTTCTTCAGCCTATATCCCAGAAGCTTCTGACTACGAAAATAATTT 

, _l + + + + 1560 



CCCT 



T GCT AGAAT AAAAGAAGTCGGATATAGGGT CT TCGAAGACTGATGC T TT T AT TAAA 



TaqI 
Alul 
CviJI 
Fnu4HI | 
Tsell I 
1 I 



1561 



Dpnl 
Faul 
Sau3AI 

Hpyl88IX Sthl32Il 
Acelll I Bcgl Mnll I I 

Bbvll 1 Mnll BsrDIt Mnll TaqI III 

M I I II I I I 1 I 

CATTAGCAGCTCGAAAAATACCTCTGAACTTCGCAATGGTGTCCTCTCTATCGAGGATCG 

. + + + + + 1620 

GTAATCGTCGAGCTTTTTATGGAGACTTGAAGCGTTACCACAGGAGAGATAGCTCCTAGC 

Fnu4HI 
Taul 
Acil I 
Nlallll 
BciVI Nspl 1 

Alwl Tsp509I Fokl Tsp509I I SphI | 

Acil I Bed IBcgl I Mnll Msel I I Cac8I I I 

II || 11 I til IN 

TGCGGGATGGCAATTCTATAAGTTCACTCAAAAAGGAGGTATCCTTAAATTAGGGCATGC 
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1621 + + + + + + 1680 

ACGCCCTACCGTTAAGATATTCAAGTGAGTTTTTCCTCCATAGGAATTTAATCCCGTACG 



Bed 
Hin4I 
Hinfl 
BseMII Ddel 
BstAPIl Hpyl88IX 
CviRI- | | Plel**| 

Mwoftl I Mwo/JatfBsmAI I I 
II I 1 III 



BfaL* 
Spell. 
I I 



ScrFI 
BsaJI 
EcoRII 
NlaLV^ *| 
CVMII I 
I I I 



GGCG AGIAT TG€AA(SAA&Ts£GCAACTC T GAG ASTCC&TGAAG T AGTGT AGG€?TC€G AGGT 

1681 — + +— + +-- - ■+ 1740 

CCGGT CAT AAGGTJF GTmEG AGGGT T GAGACTG T GAGGTA f GTTGATCA©ATGCG AGGG3?CCA ^ 



Mnll 
Ddel 
Dpnl 
Mnll 

Msel Sau3AI | | | Alul 

Vspl Msel Taqll I I I CviJI 

I I II I 

CATCATTAATAACCTTGCGATTAACCTCCCCTCGATCTTAGCAAAAGGAAAAGCTCCTAC 

1741 + + + + + + 1800 

GTAGTAATTATTGGAACGCTAATTGGAGGGGAGCTAGAATCGTTTTCCTTTTCGAGGATG 



Alftl**; 

Hija-4'K' r 



Dprvl 
NlalV 
BamH-I J 
BsfcYI | 
Sau3AI- 



I I 



=1=1= 



Bsi-HICA-Bw* 
Bsp%2,86Tl^ 
Bf^ap^ I 



Tsp5^3S3^ 



I I I I I III I 

CTTG)£GGAa!<S<SGa?<S<5r^^^^ 

1801 — + ii.^- rrT . f; — . — — : + 1860 

GAA<&&G<&TXGGGAGGAGA^TGfoT!AGA 



Hpyl88IX 
Avail 
ECOO109I 
PspSII 
Sau96I 
Sse8647I 
I 



Msel 
Mnll | 
Mnll | | 

I I I 



Pflll08I 
Dpnl 
Sau3AI 
Thai I 
Acil | i 
Alwl I | I 
I I I I 



BseMII 
Taal | 
I I 



TACTTTATCAGGTCCTCTGACACTCTTAAATGAGGAAAACCGCGATCCCTACGACAGTAT 

1861 + + + + + + 1920 

ATGAAATAGTCCAGGAGACTGTGAGAATTTACTCCTTTTGGCGCTAGGGATGCTGTCATA 



CviJI 
Ddel 
Hpy/l"8 t 8«?X^ 
DprvSifc 
Bglll J| 
BstYI :| 
Sau3AI *\ 
I I 



1921 



Maea^rlC,. FoM 
Mb^Matf HpyA8:8a»X^ * I Ma v e*M 1 ■ 

I I I II 

AGAT6T<^T(^GAG<5,eTT-TA©A^ 

. + +- + — + + + 1980 
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TCTAGAGAGACTCGGAAATGTTTTGTAAGTAGAAGAAAGAAATAGCCTACATTGTCGTGC 



Fokl 
I 



BseMII 
Msel 
Alul 
CviJI 
Hindlll ! 
Hpyl78lII I I 
I I I 



Ddel 
I 



Taal 



TCATATCAATACCGATAACTTTCATCCTGAAAGCTTAAATGCGACTGAGCATTACGGTTA 



1981 



-+ 2040 



AGTATAGTTATGGCTATTGAAAGTAGGACTTTCGAATTTACGCTGACTCGTAATGCCAAT 



BsmAI 
Bsal I 
BsmAI I 
SfaNI (BsmBI 
I t I 



Sfcl 
BsmAI | 
BsmBI I 
I I 



2041 



2101 



T CAAGG C AT CTGGTCTCCT TAT TGG G T AG AGACGAT AACAACAAC AAAT AAC GC T T CT AT 

+ + + + + + 2100 

AGTTCCGTAGACCAGAGGAATAACCCATCTCTGCTATTGTTGTTGTTTATTGCGAAGATA 

Banll 
BsiHKAI 
Bspl286l 
CjePI 
Sad 
Alul | 
CviJI | 
Mnll | Muni 
Tthllllll I Plel 
Beef I I I I Tsp509I 
Sfcl I I i IMwoI I 
II Ml I I 
AGAGACGGCAAACACCCTCT ACAGAGCT CT GT AT GCCAAT TGGACTCCCT TAGGATAT AA 

+ + -+ + + + 2160 

TCTCTGCCGTTTGTGGGAGATGTCTCGAGACATACGGTTAACCTGAGGGAATCCTATATT 



Bsu36I 
Hinfl Ddel 



HaeXV 
Hin4I 
I 



Hpyl78III 
CjePI t 



BsaJI 
Styl 
I 



Dpnl 
Bglll 
BstYI 
Sau3AI 
I 
I 



Hinfl 
Pflll08I I 
Plel I I 
I I 



GGTCAATCCTGAATACCAAGGAGATCTTGCTACGACTCCCCTATGGCAATCCTTTCATAC 



2161 



2221 



+ 2220 

CCAGTTAGGACTTATGGTTCCTCTAGAACGATGCTGAGGGGATACCGTTAGGAAAGTATG 

CviJI 
Hael 
Haelll 
Hpyl78III 
TaqI 
HphI 
EcoRV 
Mnll 
Hpyl88IX 
Hinfl I 
Tfil I 

Msel TaqI BsrI | I Mil I StuI 

| I II I Mil I I 

TATGTTCTCTCTATTAAGAAGTTATAATCGAACTGGTGATTCTGATATCGAGAGGCCTTT 

. + + + + + 2280 
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ATACAAGAGAGATAATTCTTCAATATTAGCTTGACCACTAAGACTATAGCTCTCCGGAAA 



Sthl32I 
I 



Apol Bcefl 
Tsp509I CviJI Mnll I I 

Ddel | Haelll Fokl I I Bpml I 

I I I III II 

CTTAGAAATTCAAGGGATTGCCGACGGCCTCTTTGTTCATCAAAATAGCATCCCCGGGGC 

2281 + + + + + + 2340 

GAATCTTTAAGTTCCCTAACGGCTGCCGGAGAAACAAGTAGTTTTATCGTAGGGGCCCCG 



NlalV 
Sthl32I 
CviJI 
SfaNI 
Neil 
ScrFI 
Smal 
Bsa JI I 
Mspl | 
Nc-ivi | 
ScrFI I 
Aval | | 
BsaJI | | 
BsaJI | | | 
I I I I 



HaelV 
Hin4I 
Hinfl I 
Tfil | 

ScrFI | | Mnll 
Banll I 1 I TtHttil-.lI 1 1 \ 

Bspl286I | I I BciVI SfaNI* t I 

EcoRII I I I Sfel | Fokl ... Hpy^88I*X H It 

I I I I III till 

TCCAGGATTCCGTATCCAATCTACAGGGTAT^ 

AGGTCCT^GG®ATAGGT-TAGATGTC 



HphI I 

Cvi*R^^| | 

Mb6>II-| I I 

I I I I 



Hpyl88IX 
I 



Bsvii9i6>I%. 
Dde*I«< 
I 



FaU'I*?v Dpnl 
St?htt«32®|- SauSftl*. I 
Acivl- | | Hpy^88^Xi I 
I II III 



ACATCAGAAAATCTCCTTAGGTTTTGCACAGTTCTTCACCCGCACTAAAGAAATCGGATC 

2401 + (- + + + + 24 60 

TGTAGTCTTTTAGAGGAATCCAAAACGTGTCAAGAAGTGGGCGTGATTTCTTTAGCCTAG 



CviJI 
BsmAI | 
BsmBI I 
Tthlllll | 
Maell I I 

Alwl | I I 

I I 1 



Taal 
MboII I 
I I 



CjePI 
Earl | 
BsmAI | | 
I I I 



BsaJI 
BstDSI NlalV 
Alul IDrdlll 
CviJI | Mnll] 
II II 



AAGCAACAACGTCTCGGCTCACAATACAGTCTCTTCACTTTATGTTGAGCTTCCGTGGTT 

2461 + + + + + + 2520 

TTCGTTGTTGCAGAGCCGAGTGTTATGTCAGAGAAGTGAAATACAACTCGAAGGCACCAA 



CviRI 
CjePI I 
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Bed 
NlalV 
Avail I 
CviJI Sau96l| 
Taal RleAI I HphI | I I I BsmFI 

t Mill 
CCAAGAGGCCTTTGCAACATCCCACAGTTTAGCGTATGGCTATGGGGACCATCACCTCCA 

2521 + + + + + + 2580 

GGTTCTCCGGAAACGTTGTAGGGTGTCAAATCGCATACCGATACCCCTGGTAGTGGAGGT 



CviJI 
Hael 
Haelll 
StuI 
Fokl | 
I 



2581 



Maelll 

Tsp45I Afllll BsmFI Fnu4HI 

Mnll |Hpyl78III Maell Cvi JI | Tsel ! 

Ill I II II 

CGCCTACATCCGTCACATCAAGAACAGGGCAGAAGGGACGTGTTATAGCCATACATTAGC 

+ + + + + + 2640 

GCGGATGTAGGCAGTGTAGTTCTTGTCCCGTCTTCCCTGCACAATATCGGTATGTAATCG 



Alul 
CviJI 
Fnu4HI | 
Tsel | I 
I I I 



Bbvl 
CviJI | 
Bbvl | I 
I I I 



131 

o 



Mnll 
BscGI 
CviJI 
BbvCI | 

BsaJI HphI BpulOI I 

Styl MboII | Ddel | 

I II II 

AGCAGCTATCGGCTGTTCTTTCCCTTGGCAACAGAAATCCTATCTTCACCTCAGCCCGTT 

2641 + + + + + + 2700 

TCGTCGATAGCCGACAAGAAAGGGAACCGTTGTCTTTAGGATAGAAGTGGAGTCGGGCAA 

Cjel 
MboII 

... Maelll 

Tsp4 5I 
Tthlllll 
NspV I 
TaqI I 
Earl I I 



Muni Cjel 
Tsp509I Maell 
BseMII I HphI | 

Sthl32I I (CviRI I I 



I I I 



=1= 



CGTTCAGGCAATTGCAATACGTTCTCACCAAACAGCGTTCGAAGAGATTGGTGACAATCC 



111 



2701 



-+ 2760 



GCAAGTCCGTTAACGTTATGCAAGAGTGGTTTGTCGCAAGCTTCTCTAACCACTGTTAGG 

BsaJI 
Styl 
Mnll 
Hinfl | 
Tfil I 

BsmAI CviJI Hpyl88IX Hin4I Bfal | I 

I II I I III 

CCGAAAGTTTGTCTCTCAAAAGCCTTTCTATAATCTGACCTTACCTCTAGGAATCCAAGG 



Sthl32I 
HphI I 



2761 



+ 2820 



GGCTTTCAAACAGAGAGTTTTCGGAAAGATATTAGACTGGAATGGAGATCCTTAGGTTCC 



Apol 
Tsp509I 
I 



Sfcl 
Rsal | 
BsaAI I I 
Maelll I IPlel 
I I I I I 



Hpyl78III 
Bfal I 
Xbal I I 
Hinfl I I I 
I III 



Mspl 
BsaWI | 
BsrFI 1 
PinAI | 
I I 



2821 



AAAATGGCAGTCAAAATTCCACGTACCTACAGAATGGACTCTAGAACTTTCTTACCAACC 

+ + + + + + 2880 

T T TT AC C G T CAG T TTTAAGGTGCAT G GAT G TC T TAC CT GAGAT CT T GAAAGAAT G G T TG G 
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ScrFI 
BsaJI 
EcoRII 
NlalV 
Ppil 
AciT 
BsrBI 
Cac8I 1 

Rsal BslX^. I Mnll J I 

| II 111 



Maelll 
Tsp45I 
BslI**,- I 



2881 



..+_„„-—+ 2940 



CviJI 
Hael 
Haelll 
Bfal 
Avrll | 
BsaJI | 
Styll 
EcoRV 1 I 



MslI BsrDI Rsal MboII 

. . . I II I 

GGATATCCTAGGCCATAACTATGTTCGCAATGCTTTAGGGTACAAAGTCCACAATCAAAC 



2941 



3000 



CCTATAGGATCCGGTATTGATACAAGCGTTACGAAATCCCATGTTTCAGGTGTTAGTTTG 



Hhal,, 
I 



Dprvl* 
Sau3A^ | 
Hp^l7 8^I| I 
Earvl*. t I I 

SapjL,,. Ta\q-I^ 1 
lit 



BseRI 
BslI 

Sai>3AIW 
BseR*I*r 
BsaJI I I 
Styl I I 
TaqI* * I I I 
I I II 



Bs a -I . 
Bsm^-lA.: 
AlwW j Taq>I 
I 1 I 




3061 



Apol 
Tsp509I 
Tthlllll I 
Msel | | 

Bed Rsal I I I Tsp509I 

. . I I lit I 

ATCTACGCACCATCTCCAAGCAGGAAGTACCTTAAAATTCTAAAATAAAAGAACGATAAA 



Mnll 
Mnll I 



CviJI 



Tsp509I I HpylB8TX- 
t I I 



Altf^* I 
CviJI I 
I I 
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Mnll I Hinfl 
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ATTGAAATCTTTAGAATTAACAACTATCCGATGAGCTACGTTAGCCCAATCGGTAGAGGA 

3121 + + + + + + 3180 

TAACTTTAGAAATCTTAATTGTTGATAGGCTACTCGATGCAATCGGGTTAGCCATCTCCT 

Dral 
Mnll 
Swal 
Msel I 
Apol I I 
Tsp509I I I 

I I I 
CTCCCTCAAAATTTAAATAA 

3181 + + 3200 

GAGGGAGTTTTAAATTTATT 



O 




» 
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